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NANBV DIAGNOSTICS AND VACCINES 

Technical Field 

The invention relates to materials and 
methodologies for managing the spread of non-A, non-B 
hepatitis virus (NANBV) infection. More specifically, it 
relates to polynucleotides derived from the genome of an 
etiologic agent of NANBH, hepatitis C virus (HCV), to 
polypeptides encoded therein, and to antibodies directed 
to the polypeptides. These reagents are useful as screen- 
ing agents for HCV and its infection, and as protective 
agents against the disease. 
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Background Art 

Non-A, Non-B hepatitis (NANBH) is a 
transmissible disease or family of diseases that are 
believed to be viral-induced, and that are distinguishable 
from other forms of viral-associated liver diseases, 
including that caused by the known hepatitis viruses, 
i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), 
and delta hepatitis virus (HDV), as well as the hepatitis 
induced by cytomegalovirus (CMV) or Epstein-Barr virus 
(EBV). NANBH was first identified in transfused 
individuals. Transmission from man to chimpanzee and se- 
rial passage in chimpanzees provided evidence that NANBH 
is due to a transmissible infectious agent or agents. 

Epidemiologic evidence is suggestive that there 
may be three types of NANBH: the water-borne epidemic 
type; the blood or needle associated type; and the 
sporadically occurring (community acquired) type. 
However, the number of agents which may be the causative 
of NANBH are unknown. 

Clinical diagnosis and identification of NANBH 
has been accomplished primarily by exclusion of other 
viral markers . Among the methods used to detect putative 
NANBV antigens and antibodies are agar-gel diffusion, 
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counterimmunoelectrophoresis, immunofluorescence 
microscopy ^ electron nicroscopyi radioimnunoa 

and ensyme-l ln ked immunosorbent assay. However, none of 
these essays has proved to he sufficiently sensitive, 
for I^bh - «- — as a diagnostic test 

Previously there was neither clarity nor agree- 

" Zih 3 : " iden " ty " °*°°«^*Y ot the antigen 
antxbody systems associated with agents of U ,„. 9 This 

Sibil t ,H 81 additiM ' ^ POS- 

srbrlrty that NANBH is caused by more than one infectious 
agent, as well as the possibility that NANBH has been mis- 
diagnosed. Moreover, it is unclear what the serological 
assays detect in the serum of patients with NANBH. it has 
been postulated that the agar-gel diffusion and counter 
Immunoelectrophoresis assays detect autoimmune responses 

betZ SPeC " iC Pr0t9i " ^at sometimes occur 

between serum specimens, and that they do not represent 
25 specific NANBV antigen-antibody reactions. The 

immunofluorescence, and enzyme-l iIlk ed immunosorbent, and 
radioimmunoassays appear to detect low levels of a 
rheumatoid-factor-lifce material that is frequently present 
in the serum of patients with NANBH as well as in patients 
wrth other hepatic and nonhepatic diseases. Some of the 
reactivity detected may represent antibody to host! 
determined cytoplasmic antigens. 

There have been a number of candidate NANBV. 
See for example the reviews by Prince (1983), Feinstone 
35 and Hoofnagle ,1984,, and Overby ,1985, 1986 1987, and 

the article by Iwarson (1987). However, there is no proof 
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that any of these candidates represent the etiological 
agent of NANBH. 

The demand for sensitive, specific methods for 
screening and identifying carriers of NANBV and NANBV 
contaminated blood or blood products is significant. 
Post-trans fusion hepatitis (PTH) occurs in approximately 
10% of transfused patients, and NANBH accounts for up to 
90% of these cases. The major problem in this disease is 
the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of 
transmission of NANBH by blood and blood products or by 
close personal contact require reliable screening, 
diagnostic and prognostic tools to detect nucleic acids, 
antigens and antibodies related to NANBV. in addition, 
there is also a need for effective vaccines and 
immunotherapeutic therapeutic agents for the prevention 
and/or treatment of the disease. 

Applicant discovered a new virus, the Hepatitis 
C virus (HCV), which has proven to be the major etiologic 
agent of blood-borne NANBH ( BB -NANBH ) . Applicant's 
initial work, including a partial genomic sequence of the 
prototype HCV isolate, CDC/HCV1 (also called HCV1), is 
described in EPO Pub. No. 318,216 (published 31 May 1989) 
and PCT Pub. No. WO 89/04669 (published. 1 June 1989). The 
disclosures of these patent applications, as well as any 
corresponding national patent applications, are in- 
corporated herein by reference. These applications teach, 
inter alia, recombinant DNA methods of cloning and 
expressing HCV sequences, HCV polypeptides, HCV 
immunodiagnostic techniques, HCV probe diagnostic 
techniques, anti-HCV antibodies, and methods of isolating 
new hCV sequences, including sequences of new HCV 
isolates . 
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Disclosure o f the Invention 

HCV seCTue Pr ! Sent inV6nti0n is bas ^, in part, on new 

5 I ^ SnCeS and P^ypeptides that are not disclosed in 
EPO Pub. No. 318,216, or in PCT Pub. No. WO 89/04669 
included within the invention is the application of ihese 
new sequences and polypeptides in, inter alia 
immunodiagnostics, probe diagnostics, anti-HCV antibody 
production, PC R technology and recombinant DNA technology, 
included within the invention, also, are new immunoassay^ 
based upon the immunogenic ity of HCV polypeptides 
disclosed herein. The new subject matter claimed herein, 
while developed using techniques described in, for 
^ example, EPO Pub. No. 318,216, has a priority date which 
antecedes that publication, or any counterpart thereof. 
Thus, the invention provides novel compositions and 

Int^"^ 1 SCreenin * sam P les *°* HCV antigens and 

antxbodies, and useful for treatment of HCV infections. 

Accordingly, one aspect of the invention is a 
recombinant polynucleotide comprising a sequence derived 
from HCV cDNA, wherein the HCV cDNA is in clone 13i or 

clZ ° r Cl ° ne 59a ' ° r Cl ° ne 84a ' « Clone or 

clone 167b, or clone pil4a, or clone CA216a, or clone 

25 CA290a, or clone ag30a, or clone 205a, or clone 18g, or 
clone 16jh, or wherein the HCV cDNA is of a sequence 
indicated by nucleotide numbers -319 to 1348 or 8659 to 
8866 in Fig. 17. 

Another aspect of the invention is a purified 
polypeptide comprising an epitope encoded within HCV cDNA 
wherexn the HCV cDNA is of a sequence indicated by 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 



20 



30 



Yet another aspect of the invention is an im- 
35 munogenic polypeptide produced by a cell transformed with 
a recombinant expression vector comprising an ORF of DNA 
derxved from HCV cDNA, wherein the HCV cDNA is comprised 
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of a sequence derived from the HCV cDNA sequence in clone 
CA279a, or clone CA74a, or clone 13i, or clone CA290a, or 
clone 33C or clone 40b, or clone 33b, or clone 25c, or 
clone 14c, or clone 8f, or clone 33f, or clone 33g, or 
clone 39c, or clone 15e, and wherein the ORF is operably 
linked to a control sequence compatible with a desired 
host . 

Another aspect of the invention is a peptide 
comprising an HCV epitope, wherein the peptide is of the 
formula 



AA x -AA y , 

wherein x and y designate amino acid numbers shown in Fig. 
17, and wherein the peptide is selected from the group 
consisting of AA1-AA25, AA1-AA50, AA1-AA84, AA9-AA177, 
AA1-AA10, AA5-AA20, AA20-AA25, AA35-AA45, AA50-AA100, 
AA40-AA90, AA45-AA65, AA65-AA75, AA80-90, AA99-AA120, 
AA95-AA110, AA105-AA120, AA100-AA150, AA150-AA200, 
AA155-AA170, AA190-AA210, AA200-AA250, AA220-AA240, 
AA245-AA265, AA250-AA300, AA290-AA330, AA290-305, 
AA300-AA350, AA310-AA330, AA350-AA400, AA380-AA395, 
AA405-AA495, AA400-AA450, AA405-AA415, AA415-AA425, 
AA425-AA435 , AA437-AA582, AA450-AA500, AA440-AA460, 
AA460-AA470, AA475-AA495, AA500-AA550, AA511-AA690, AA515- 
AA550, AA550-AA600, AA550-AA625, AA575-AA605, AA585-AA600, 
AA600-AA650, AA600-AA625, AA635-AA665, AA650-AA700, 
AA645-AA680, AA700-AA750, AA700-AA725, AA700-AA750, 
AA725-AA775, AA770-AA790, AA750-AA800, AA800-AA815, 
AA825-AA850, AA850-AA875, AA800-AA850, AA920-AA990, 
AA850-AA900, AA920-AA945, AA940-AA965, AA970-AA990, 
AA950-AA1000, AA1000-AA1060 , AA1000-AA1025 , AA1000-AA1050 , 
AA1025-AA1040, AA1040-AA1055 , AA1075-AA1175 , 
AA1050-AA1200, AA1070-AA1100 , AA1100-AA1130 , 
AA1140-AA1165, AA1192-AA1457 , AA1195-AA1250 , 
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AA1200-AA1225, AA1225-AA1250, AA1250-AA1300 , 
AA1260-AA1310, AA1260-AA1280, AA1266-AA1428 
AA1300-AA1350, AA1290-AA1310, AA1310-AA1340 , AA1345- 
AA1405, AA1345-AA1365, AA1350-AA1400, AA1365-AA1380 
AA1380-AA1405, AA1400-AA1450, AA1450-AA1500 , 
AA1460-AA1475, AA1475-AA1515 , AA1475-AA1500 ,' 
AA1500-AA1550, AA1500-AA1515, AA1515-AA1550 
AA1550-AA1600, AA1545-AA1560, AA1569-AA1931 ' 
AA1570-AA1590, AA1595-AA1610, AA1590-AA1650 ', 
AA1610-AA1645, AA1650-AA1690 , AA1685-AA1770 , 
AA1689-AA1805, AA1690-AA1720, AA1694-AA1735 ' 
AA1720-AA1745, AA1745-AA1770, AA1750-AA1800 
AA1775-AA1810, AA1795-AA1850, AA1850-AA1900 , 
AA1900-AA1950, AA1900-AA1920, AA1916-AA2021 ' 
AA1920-AA1940, AA1949-AA2124 , AA1950-AA2000 
AA1950-AA1985, AA1980-AA2000 , AA2000-AA2050 , 
AA2005-AA2025, AA2020-AA2045 , AA2045-AA2100 
AA2045-AA2070, AA2054-AA2223 , AA2070-AA2100 ,' 
AA2100-AA2150, AA2 150-AA2200 , AA2200-AA2250,' 
AA2200-AA2325, AA2250-AA2330 , AA2255-AA2270 ' 
AA2265-AA2280, AA2280-AA2290 , AA2287-AA2385 , 
AA2300-AA2350, AA2290-AA2310 , AA2310-AA233o! 
AA2330-AA2350, AA2350-AA2400 , AA2348-AA2464 ,' 
AA2345-AA2415, AA2345-AA2375 , AA2370-AA2410 
AA2371-AA2502, AA2400-AA2450 , AA2400-AA2425 , 
AA2415-AA2450, AA2445-AA2500 , AA2445-AA2475 ,' 
AA2470-AA2490, AA2500-AA2550, AA2505-AA2540 ' 
3Q AA2535-AA2560, AA2550-AA2600 , AA2560-AA2580 ,' 
AA2600-AA2650, AA2605-AA2620 , AA2620-AA2650, 
AA2640-AA2660, AA2650-AA2700, AA2655-AA2670 , 
AA2670-AA2700, AA2700-AA2750 , AA2740-AA2760 , 
AA2750-AA2800, AA2755-AA2780, 

AA2780-AA2830, AA2785-AA2810 , AA2796-AA2886 , 
AA2810-AA2825, AA2800-AA2850 , AA2850-AA2900, 
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AA2850-AA2865, AA2885-AA2905, AA2900-AA2950 , 
AA2910-AA2930, AA2925-AA2950, AA2945-end(C ' terminal). 

Still another aspect of the invention is a 
monoclonal antibody directed against an epitope encoded in 
HCV cDNA, wherein the HCV cDNA is of a sequence indicated 
by nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 
17, or is the sequence present in clone 13i, or clone 26 j, 
or clone 59a, or clone 84a, or clone CA156e, or clone 
167b, or clone pil4a, or clone CA216a, or clone CA290a, or 
clone ag30a, or clone 205a, or clone 18g, or clone 16jh. 

Yet another aspect of the invention is a 
preparation of purified polyclonal antibodies directed 
against a polypeptide comprised of an epitope encoded 
within HCV cDNA, wherein the HCV cDNA is of a sequence 
indicated by nucleotide numbers -319 to 1348 or 8659 to 
8866 in Fig. 17, or is the sequence present in in clone 
13i, or clone 26 j, or clone 59a, or clone 84a, or clone 
CA156e, or clone 167b, or clone pil4a, or clone CA216a, or 
clone CA290a, or clone ag30a, or clone 205a, or clone 18g, 
or clone 16 jh. 

Still another aspect of the invention is a 
polynucleotide probe for HCV, wherein the probe is 
comprised of an HCV sequence derived from an HCV cDNA 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17, or from the complement of the HCV 
cDNA sequence. 

Yet another aspect of the invention is a kit for 
analyzing samples for the presence of polynucleotides from 
HCV comprising a polynucleotide probe containing a 
nucleotide sequence of about 8 Or more nucleotides, 
wherein the nucleotide sequence is derived from HCV cDNA 
which is of a sequence indicated by nucleotide numbers - 
319 to 1348 or 8659 to 8866 in Fig. 17, wherein the 
polynucleotide probe is in a suitable container. 
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• An ° ther as P ect °* the invention is a kit £or 
analysxng samples for the presance an *" * 

» a °nTv Sin :- an Mtib0dy " hi ° h ^-^ally with 

n:r ' H rr r ains an 

* nucleotide numhers ^or ^3.^, 

17, or wherein the HCV cDNA is in clone 13i nr i 

» r^r t- or cione 84 - - —"ciis 1 nz 26j - 

167b, or clone pil4a, or clone* P»„f ne 
f or clone CA216a, or clone CA7Qn a 

Cione ag30a or clone 205a , or clone 18g> Qr - ~ 

analog J^TS 2^.2" ^ T " 
is comprising an antigenic J^VlZZnlTZ 
epxtope encoded within acv cDNA which is of T 

or clone p± 14 «, or clone CA2l6a, or clone CA290a 0 / 

ag30a, or clone 205a, or clone l*a « , CA29 ° a ' ° r clone 

, w cione I8g, or clone 16jh. 

analysing T^l ^J^TT^ ^ 

^r: r p cr eptide — T ™ : - 

» clone 01,90a. « ^"^^ « 

c C lo°:e e 3 C ' " M ClOM « "one 3 f " 

clone 33g, or clone 39c. or clone 15e, wherein the 

antxgenic po vpeptide is present in a suitaole container. 
30 ™,h^ « ! another aspect of the invention is a 
method for detecting HCV nucleic acids in a salte 
comprising! sample 

m ,„ , (a) reacti "9 nucleic acids of the sample with a 
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reacting is under conditions which allow the formation of 
a polynucleotide duplex between the probe and the HCV 
nucleic acid from the sample; and (b) detecting a 
polynucleotide duplex which contains the probe, formed in 
step (a) . 

Yet another aspect of the invention is an 
immunoassay for detecting an HCV antigen comprising: 

(a) incubating a sample suspected of containing 
an HCV antigen with an antibody directed against an HCV 
epitope encoded in HCV cDNA, wherein the HCV cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17, or is the sequence present in 
clone 131, or clone 26 j , or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pil4a, or clone 
CA216a, or clone CA290a, or clone ag30a, or clone 205a, or 
clone 18g, or clone 16 jh, and wherein the incubating is 
under conditions which allow formation of an antigen- 
antibody complex; and (b) detecting an antibody-antigen 
complex formed in step (a) which contains the antibody. 

Still another aspect of the invention is an 
immunoassay for detecting antibodies directed against an 
HCV antigen comprising: 
25 < a ) incubating a sample suspected of containing 

anti-HCV antibodies with an antigen polypeptide containing 
an epitope encoded in HCV cDNA, wherein the HCV cDNA is of 
a sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17, or is the sequence present in 
clone 13i, or clone 26 j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pil4a, or clone 
CA216a, or clone CA290a, or clone ag30a, or clone 205a, or 
clone 18g, or clone 16jh, and wherein the incubating is 
under conditions which allow formation of an antigen- 
antibody complex; and detecting an antibody-antigen 
complex formed in step (a) which contains the antigen 
polypeptide . 
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Another aspect of the invention is a vaccine for 
treatment of HCV infection comprising an immunogenic 
polypeptide containing an HCV epitope encoded in HCV cDNA 
wherein the HCV cDNA is of a sequence indicated by 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig 17 
or xs the sequence present in clone 13i, or clone 26 1, or 
clone 59a, or clone 84a, or clone CA156e, or clone 167b, 

10 aa30a° n or P cr a ' " Cl °~ CA290 *' « <*°- 

ag30a, or clone 205a, or clone 18g, or clone 16jh, and 

wherein the immunogenic polypeptide is present in a 
Pharmacologically effective dose in a pharmaceutic ally 
acceptable excipient. 

15 £or r>r* H ^ an ° ther aSPSCt ° f thS invent ion is a method 
for producxng antibodies to HCV comprising administering 
to an individual an isolated immunogenic polyeptide 

HCrcDNA^ an / CV SPitOPe enC ° ded in HCV CDNA ' Wh6 -- «* 
" 9 SeqU6nCe indicated ^ nucleotide numbers 
2q -319 to 1348 or 8659 to 8866 in Fig. l 7 , or is of the 

sequence present i„ clone CA279a, or clone CA74a, or clone 

131, or clone CA290a, or clone 33C or clone 40b, or clone 

33b, or clone 25c, or clone 14c, or clone 8f , or clone 

33f, or clone 33g, or clone 39c, or clone 15e, and wherein 

(s the immunogenic polypeptide is present in a 

pharmacologically effective dose in a pharmaceutical^ 

acceptable excipient. 

Still another aspect of the invention is an 
antisense polynucleotide derived from HCV cDNA, wherein 
Q the HCV cDNA is that shown in Fig. 17. 

Yet another aspect of the invention is a method 
for preparing purified fusion polypeptide C100-3 compris- 

(a) providing a crude cell lysate containing 
polypeptide C100-3, 
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(b) treating the crude cell lysate with an 
amount of acetone which causes the polypeptide to 
precipitate, 

(c) isolating and solubilizing the precipitated 

material, 

(d) isolating the C100-3 polypeptide by anion 
exchange chromatography, and 

(e) further isolating the C100-3 polypeptide of 
step (d) by gel filtration. 



Brief Description of the Drawings 

Fig. 1 shows the sequence of the HCV cDNA in 
clone 12f, and the amino acids encoded therein. 

Fig. 2 shows the HCV cDNA sequence in clone k9- 
1, and the amino acids encoded therein. 

Fig. 3 shows the sequence of clone 15e, and the 
amino acids encoded therein. 

Fig. 4 shows the nucleotide sequence of HCV cDNA 
in clone 13i, the amino acids encoded therein, and the 
sequences which overlap with clone 12f . 

Fig. 5 shows the nucleotide sequence of HCV cDNA 
in clone 26 j, the amino acids encoded therein, and the 
sequences which overlap clone 13i. 

Fig. 6 shows the nucleotide sequence of HCV cDNA 
in clone CA59a, the amino acids encoded therein, and the 
sequences which overlap with clones 26 j and K9-1. 

Fig. 7 shows the nucleotide sequence of HCV cDNA 
in clone CA84a, the amino acids encoded therein, and the 
sequences which overlap with clone CA59a. 

Fig. 8 shows the nucleotide sequence of HCV cDNA 
in clone CA156e, the amino acids encoded therein, and the 
sequences which overlap with CA84a. 
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in clone cl«; b ' ITIZ -"l*""* «*—- Ot HCV CDNA 

seance, which ^ 'Zxtlt therSin ' - «» 

Fig. 10 shows the nucleotide sequence of H rv 
5 CDMA in clone CA216a, the an.ino acids enco^d l h ll™ and 
the overlap with clone CA167b. serein, and 

Fig. 11 shows the nucleotide sequence of hpv 
c«n in clone CA290a, the a.ino acids encoded therein and 
the overlap with clone CA216a. serein, and 

10 Fi9 * 12 Shows the nucleotide sequence of HCV 

CDNA in clone ag30a and the overlap with Lne c^uT 

cdna • i 13 ShOWS thS nucleoti <** sequence of HCV 

CDNA xn clone CA205a, and the overlap with the HCV cDNA 
sequence in clone CA290a. 

cdna • / ±9 ' " Sh ° WS the nucle °tide sequence of HCV 
CDNA ,n clone l 8 g, and the overlap with the HCV cDNA 
sequence in clone ag30a. 

Fig. 15 shows the nucleotide sequence of hpv 
cdna in clone l 6j h, the a^ino acids encoded therein Ld 

c\ h o e ne° V Ise aP °' nUCle ° tideS ™ °» »™ « 

Fig. 16 shows the ORF of HCV cDNA derived from 
clones pi 14a , CAl S7b . CA15 6 „ 0,84a, CA59a, K S -Tl2 

" «=: i*' £ 8h ; 33 !; 40b - 37b - 3S - 3,, 3 3b , 

5C, Itc, Bt, 33f, 33g, 39c, 3Sf, 19g, 26g, and 15.. 

HCV =n», . 17 ShOWS *"* Sen " e StrMd of the compiled 

HCV CDNA sequence derived from the above-described clones 
and the compiled HCV coma sequence published i„ EP0 Ptt T 

30 derived'areblL. T" * r0m ""^ S " UenC8 ~ 
aerlved are bU4a, 18g, ag30a, CA205a, CA290a, CA21Sa 

i £ »" 7b ; 2 r i !!r CA84a - CAS9a - — «- 

Ho iL « ! ' l4 °' 33£ ' 339 ' 39c ' 35f, 19g, 

26g, ISe, b5 a, and lS jh. l„ the figure the three 

« borstal dashes above the sequence indicate t„e position 
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of the putative initiator methionine codonj the two 
vertical dashes indicate the first and last nucleotides of 
the published sequence. Also shown in the figure is the 
amino acid sequence of the putative polyprotein encoded 
in the HCV cDNA. 

Fig. 18 is a diagram of the immunological colony 
screening method used in antigenic mapping studies. 

Fig. 19 shows the hydrophobicity profiles of 
polyproteins encoded in HCV and in West Nile virus. 

Fig. 20 is a tracing of the hydrophilicity/ 
hydrophobicity profile and of the antigenic index of the 
putative HCV polyprotein. 

15 Fia * 21 shows the conserved co-linear peptides 

in HCV and Flaviviruses . 



10 



Modes for Carr ying Out the Invention 

I • Definitions 

The term "hepatitis C virus" has been reserved 
by workers in the field for an heretofore unknown 
etiologic agent of NANBH. Accordingly, as used herein, 
"hepatitis C virus" (HCV) refers to an agent causitive of 
NANBH, which was formerly referred to as NANBV and/or BB- 
NANBV. The terms HCV, NANBV, and BB-NANBV are used inter- 
changeably herein. As an extension of this terminology, 
the disease caused by HCV, formerly called NANB hepatitis 
(NANBH), is called hepatitis C. The terms NANBH and 
hepatitis C may be used interchangeably herein. 

The term "HCV", as used herein, denotes a viral 
species of which pathogenic strains cause NANBH, and at- 
tenuated strains or defective interfering particles 
derived therefrom. As shown infra., the HCV genome is 
comprised of RNA. It is known that RNA containing viruses 
have relatively high rates of spontaneous mutation, i.e., 
reportedly on the order of 10" 3 to 10~ 4 per incorporated 
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nucleotide (Fields . Knipe (1986,,. Therefore, there are 
Multiple strains, which «,ay be virulent or avirulent 

* TT SPeCieS dMCribed in£ "- ™° options 

and method, described herein, enable the propagation 

\ ? ' dete ° ti0n ' Md Elation of the varies 
" ,tr r ° r Moreover, the disclosure herein 

allows the preparation of diagnostics and vaccines for the 

10 have^T 1 ""' M " Sl1 ^ com P° si "ons and Methods that 
have utilxty » screening procedures for anti-viral agents 
for pharmacologic use, such as agents that inhibit 
replication of HCV. 

The information provided herein, although 
is derived from the prototype strain or isolate of HCV 

hereinafter referred to as CDC/HCV1 (also called HCV1), is 
sufficient to allow a viral taxonomist to identify otner 
strains which fall within the species. The information 
provxded herein allows the belief that HCV is a Flavi-like 
2q virus The morphology and composition of Flavivirus 

partxcles are known, and are discussed in Brinton (1986) 
Generally, with respect to morphology, Flaviviruses 
contain a central nucleocapsid surrounded by a lipid 

aboT^ J" 10 " ^ SPhSriCal ^ haVS * «^~tL of 
25 about 40-50 nm. Their cores are about 25-30 nm in 

trTZT' ^ ^ ° Ut6r SUrfaCS ° f —lope 

In \ I ab ° Ut 5 " 10 - lon * terminal 

knobs about 2 nm in diameter. 

Different strains or isolates of HCV are 
3q expected to contain variations at the amino acid and 

nucleic acids compared with the prototype isolate, HCV1. 

TVI« SXpeCted t0 -howmuch (i.e. more than 

about 40%) homology in the total amino acid sequence 
compared with HCV1. However, it may also be found that 
35 other less homologous HCV isolates. These would be 

defined as HCV strains according to various criteria such 
as an ORF of approximately 9,000 nucleotides to ap- 
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proximately 12,000 nucleotides, encoding a polyprotein 
similar in size to that of HCV1, an encoded polyprotein of 
similar hydrophobic and antigenic character to that of 
HCV1, and the presence of co-linear peptide sequences that 
are conserved with HCVi. m addition, the genome would be 
a positive -stranded RNA. 

HCV encodes at least one epitope which is im- 
munologically identifiable with an epitope in the HCV 
genome from which the cDNAs described herein are derived; 
preferably the epitope is contained an amino acid sequence 
described herein. The epitope is unique to HCV when 
compared to other known Flaviviruses . The uniqueness of 
the epitope may be determined by its immunological re- 
activity with anti-HCV antibodies and lack of im- 
munological reactivity with antibodies to other Flavivirus 
species. Methods for determining immunological reactivity 
are known in the art, for example, by radioimmunoassay, by 
Elisa assay, by hemagglutination, and several examples of 
suitable techniques for assays are provided herein. 

In addition to the above, the following 
parameters of nucleic acid homology and amino acid homol- 
ogy are applicable, either alone or in combination, in 
identifying a strain or isolate as HCV. Since HCV strains 
and isolates are evolutionarily related, it is expected 
that the overall homology of the genomes at the nucleotide 
level probably will be about 40% or greater, probably 
about 60% or greater, and even more probably about 80% or 
greater; and in addition that there will be corresponding 
contiguous sequences of at least about 13 nucleotides. 
The correspondence between the putative HCV strain genomic 
sequence and the CDC /HCVI cDNA sequence can be determined 
by techniques known in the art. For example, they can be 
determined by a direct comparison of the sequence informa- 
tion of the polynucleotide from the putative HCV, and the 
HCV cDNA sequence(s) described herein. For example, also, 
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they can be determined by hybridization of the 
polynucleotides under conditions which form stable 
s duplexes between homologous regions (for example, those 
whxch would be used prior to S, digestion), followed by 
dxgestion with single stranded specific nuclease(s), fol- 
lowed by size determination of the digested fragments. 

Because of the evolutionary relationship of the 
iQ straxns or isolates of HCV, putative HCV strains or 
isolates are identifiable by their homology at the 
polypeptide level. Generally, HCV strains or isolates are 
expected to be more than about 40% homologous, probably 
more than about 70% homologous, and even more probably 
15 more than about 80% homologous, and some may even be more 
than about 90% homologous at the polypeptide level. The 
techniques for determining amino acid sequence homology 
are known in the art. For example, the amino acid 
sequence may be determined directly and compared to the 
2Q sequences provided herein. Alternatively the nucleotide 
sequence of the genomic material of the putative HCV may 
be determined (usually via a cDNA intermediate), the amino 
acxd sequence encoded therein can be determined, and the 
corresponding regions compared. 

25 USed her *in, a polynucleotide "derived from" 

wh±!» T ated SeqUSnCe rSferS t0 S Nucleotide sequence 
whxch is comprxsed of a sequence of approximately at least 

about 6 nucleotides, preferably at least about 8 

nucleotides, more preferably at least about 10-12 

3Q nucleotides, and even more preferably at least about 15-20 
nucleotides corresponding to a region of the designated 
nucleotide sequence. "Corresponding" means homologous to 
or complementary to the designated sequence. Preferably, 
the sequence of the region from which the polynucleotide 

35 ". d ^t Ved " homo ^gous to or complementary to a sequence 
whxch xs unique to an HCV genome. Whether or not a 
sequence is unique to the HCV genome can be determined by 
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techniques known to those of skill in the art. For 
example, the sequence can be compared to sequences in 
databanks, e.g., Genebank, to determine whether it is 
present in the uninfected host or other organisms. The 
sequence can also be compared to the known sequences of 
other viral agents, including those which are known to 
induce hepatitis, e.g., HAV, HBV, and HDV, and to other 
members of the Flaviviridae . The correspondence or non- 
correspondence of the derived sequence to other sequences 
can also be determined by hybridization under the ap- 
propriate stringency conditions. Hybridization techniques 
for determining the complementarity of nucleic acid 
sequences are known in the art, and are discussed infra. 
See also, for example, Maniatis et al . (1982). in addi- 
tion, mismatches of duplex polynucleotides formed by 
hybridization can be determined by known techniques, 
including for example, digestion with a nuclease such as 
SI that specifically digests single- stranded areas in 
duplex polynucleotides. Regions from which typical DNA 
sequences may be "derived" include but are not limited to, 
for example, regions encoding specific epitopes, as well 
as non- transcribed and/or non- translated regions. 

The derived polynucleotide is not necessarily 
physically derived from the nucleotide sequence shown, but 
may be generated in any manner, including for example, 
chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, combinations 
of regions corresponding to that of the designated 
sequence may be modified in ways known in the art to be 
consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 
to that of a polypeptide encoded in the sequence, or a 
portion thereof wherein the portion consists of at least 
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3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
5 acids, or which is immunologically identifiable with a 
polypeptide encoded in the sequence. 

A recombinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 
sequence, for example, the HCV cDNA sequences described 
iQ herein, or from an HCV genome; it may be generated in any 
manner, including for example, chemical synthesis, or 
expression of a recombinant expression system, or isola- 
tion from mutated HCV. A recombinant or derived 
polypeptide may include one or more analogs of amino acids 
is or unnatural amino acids in its sequence. Methods of 

inserting analogs of amino acids into a sequence are known 
xn the art. it also may include one or more labels, which 
are known to those of skill in the art. 

The term "recombinant polynucleotide" as used 
2o herein intends a polynucleotide of genomic, cDNA, 

semisynthetic, or synthetic origin which, by virtue of its 
origin or manipulation which: (1, ±. not associated with 
all or a portion of a polynucleotide with which it is as- 
sociated in nature, (2) is ii nked to a polynucleotide 
2s other than that to which it is linked in nature, or (3) 
does not occur in nature. 

The term "polynucleotide" as used herein refers 
to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term refers 
3Q only to the primary structure of the molecule. Thus, this 
term includes double- and single-stranded DNA, as well as 
double- and single stranded RNA. It also includes known 
types of modifications, for example, labels which are 
known in the art, methylation, "caps", substitution of one 
3s or more of the naturally occurring nucleotides with an 
analog, internucleotide modifications such as, for 
example, those with uncharged linkages (e.g., me thyl 
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phosphonates , phosphotriesters , phosphoamidates , 
carbamates, etc.) and with charged linkages (e.g., 
phosphorothioates , phosphorodithioates , etc.), those 
containing pendant moieties, such as, for example proteins 
(including for e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L- lysine, etc.), those with intercalators 
(e.g., acridine, psoralen, etc.), those containing 
chelators (e.g., metals, radioactive metals, boron, oxida- 
tive metals , etc . ) , those containing alkylators , those 
with modified linkages (e.g., alpha anomeric nucleic 
acids, etc.), as well as unmodified forms of the 
polynucleotide . 

The term "purified viral polynucleotide" refers 
to an HCV genome or fragment thereof which is essentially 
free, i.e., contains less than about 50%, preferably less 
than about 70%, and even more preferably less than about 
90% of polypeptides with which the viral polynucleotide is 
naturally associated. Techniques for purifying viral 
polynucleotides from viral particles are known in the art, 
and include for example, disruption of the particle with a 
chaotropic agent, differential extraction and separation 
of the polynucleotide^ ) and polypeptides by ion-exchange 
chromatography, affinity chromatography, and sedimentation 
according to density. 

The term "purified viral polypeptide" refers to 
an HCV polypeptide or fragment thereof which is es- 
sentially free, i.e., contains less than about 50%, 
preferably less than about 70%, and even more preferably 
less than about 90%, of cellular components with which the 
viral polypeptide is naturally associated. Techniques for 
purifying viral polypeptides are known in the art, and 
examples of these techniques are discussed infra. The term 
"purified viral polynucleotide" refers to an HCV genome or 
fragment thereof which is essentially free, i.e., contains 
less than about 20%, preferably less than about 50%, and 
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even more preferably less than about 70% of polypeptides 
with which the viral polynucleotide is naturally associ- 
5 ated. Techniques for purifying viral polynucleotides from 
viral particles are known in the art, and include for 
example, disruption of the particle with a chaotropic 
agent, and separation of the polynucleotide ( s ) and 
polypeptides by ion-exchange chromatography, affinity 
io chromatography, and sedimentation according to density. 

"Recombinant host cells", "host cells", "cells" 
"cell lines", "cell cultures", and other such terms denot^ 
xng microorganisms or higher eukaryotic cell lines 
cultured as unicellular entities refer to cells which can 
is be, or have been, used as recipients for recombinant vec- 
tor or other transfer DNA, and include the progeny of the 
original cell which has been trans fected. it is 
understood that the progeny of a single parental cell may 
not necessarily be completely identical in morphology or 
2Q xn genomic or total DNA complement as the original parent, 
due to natural, accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
plasmid, a chromosome, a virus, a cosmid, etc. that 
behaves as an autonomous unit of polynucleotide replica- 
2s tion within a cell; i.e., capable of replication under its 
own control. 

A "vector" is a replicon in which another 
polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached segment. 

30 "Control sequence" refers to polynucleotide 

sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature of 
such control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences generally 

35 xnclude promoter, ribosomal binding site, and terminators- 
xn eukaryotes, generally, such control sequences include 
promoters, terminators and, in some instances, enhancers 
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The term "control sequences" is intended to include, at a 
minimum, all components whose presence is necessary for 
expression, and may also include additional components 
whose presence is advantageous, for example, leader 
sequences . 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is 
ligated in such a way that expression of the coding 
sequence is achieved under conditions compatible with the 
control sequences . 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
region may represent a portion of a coding sequence or a 
total coding sequence . 

A "coding sequence" is a polynucleotide sequence 
which is transcribed into mRNA and/or translated into a 
polypeptide when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding 
sequence are determined by a translation start codon at 
the 5 '-terminus and a translation stop codon at the 3'- 
terminus. A coding sequence can include, but is not 
limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Immunologically identifiable with/as" refers to 
the presence of epitope(s) and polypeptides ( s ) which are 
also present in the designated polypeptide ( s ) , usually HCV 
proteins. Immunological identity may be determined by 
antibody binding and/or competition in binding; these 
techniques are known to those of average skill in the art, 
and are also illustrated infra. 

As used herein, "epitope" refers to an antigenic 
determinant of a polypeptide; an epitope could comprise 3 
amino acids in a spatial conformation which is unique to 
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the epitope, generally an epitope consists of at least 5 
such amino acids, and more usually, consists of at least 
8-10 such amino acids. Methods of determining the spatial 
conformation of amino acids are known in the art, and 
xnclude, for example, x-ray crystallography and 2- 
dimensional nuclear magnetic resonance. 

A polypeptide is "immunologically reactive" with 
an antibody when it binds to an antibody due to antibody 
recognition of a specific epitope contained within the 
polypeptide. Immunological reactivity may be determined 
by antibody binding, more particularly by the kinetics of 
antibody binding, and/or by competition in binding using 
as competitor (s) a known polypeptide ( s ) containing an 
epitope against which the antibody is directed. The 
techniques for determining whether a polypeptide is im- 
munologically reactive with an antibody are known in the 



20 AS USSd herei "' the term "immunogenic 

polypeptide" is a polypeptide that elicits a cellular and/ 
or humoral response, whether alone or linked to a carrier 
xn the presence or absence of an adjuvant. 

The term "polypeptide" refers to a polymer of 
25 amino acids and does not refer to a specific length of the 
product; thus, peptides, oligopeptides, and proteins are 
included within the definition of polypeptide. This term 
also does not refer to or exclude post-expression 
modifications of the polypeptide, for example, 
3Q glycosylations, acetylations, phosphorylations and the 
like. included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
etc.), polypeptides with substituted linkages, as well as 
3s other modifications known in the art, both naturally oc- 
curring and non-naturally occurring. 
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"Transformation", as used herein, refers to the 
insertion of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for 
example, direct uptake, transduction, f -mating or 
electroporation. The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the host 
genome . 

"Treatment" as used herein refers to prophylaxis 
and/ or therapy. 

An "individual", as used herein, refers to 
vertebrates, particularly members of the mammalian spe- 
cies, and includes but is not limited to domestic animals, 
sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic 
acid contains the sequence that has sequence homology to 
that of mRNA. The "anti-sense strand" contains a sequence 
which is complementary to that of the "sense strand". 

As used herein, a "positive stranded genome" of 
a virus is one in which the genome, whether RNA or DNA, is 
single-stranded and which encodes a viral polypeptide (s ) . 
Examples of positive stranded RNA viruses include 
Togaviridae, Coronaviridae, Retroviridae , Picornaviridae, 
and Caliciviridae. Included also, are the Flaviviridae , 
which were formerly classified as Togaviradae. See Fields 
& Knipe (1986) . 

As used herein, "antibody-containing body 
component" refers to a component of an individual's body 
which is a source of the antibodies of interest. Antibody 
containing body components are known in the art, and 
include but are not limited to, for example, plasma, 
serum, spinal fluid, lymph fluid, the external sections of 
the respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, white blood cells, and myelomas. 
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As used herein, "purified HCV" refers to a 
preparation of HCV which has been isolated from the cel- 
lular constituents with which the virus is normally as- 
sociated, and from other types of viruses which may be 
present in the infected tissue. The techniques for 
isolating viruses are known to those of skill in the art, 
and include, for example, centrifugation and affinity 
chromatography; a method of preparing purified HCV is 
discussed infra. 

The term "HCV particles" as used herein include 
entire virion as well as particles which are intermediates 
xn virion formation. HCV particles generally have one or 
more HCV proteins associated with the HCV nucleic acid. 

As used herein, the term "probe" refers to a 
polynucleotide which forms a hybrid structure with a 
sequence in a target region, due to complementarity of at 
least one sequence in the probe with a sequence in the 
target region. The probe, however, does not contain a 
sequence complementary to sequence(s) used to prime the 
polymerase chain reaction. 

As used herein, the term "target region" refers 
to a region of the nucleic acid which is to be amplified 
and/or detected. 

As used herein, the term "viral RNA", which 
includes HCV RNA, refers to RNA from the viral genome, 
fragments thereof, transcripts thereof, and mutant 
sequences derived therefrom. 

As used herein, a "biological sample" refers to 
a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, and also 
samples of in vitro cell culture constituents (including 
but not limited to conditioned medium resulting from the 
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growth of cells in cell culture medium, putatively virally 
infected cells, recombinant cells, and cell components). 

II • Description of the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the 
art. Such techniques are explained fully in the 
literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL (1982); DNA CLON- 
ING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC 
ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds . 1984); 
TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins' 
eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); 
IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); 
the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); 
GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory), 
Methods in Enzymology Vol. 154 and Vol. 155 (Wu and 
Grossman, and Wu, eds., respectively), Mayer and Walker, 
eds. (1987), IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR 
BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES i-iv (D.M. Weir and C. C. Blackwell eds 
1986). All patents, patent applications, and publications 
mentioned herein, both supra and infra, are hereby in- 
corporated herein by reference. 

The useful materials and processes of the 
present invention are made possible by the provision of a 
family of nucleotide sequences isolated from cDNA librar- 
ies which contain HCV cDNA sequences. These cDNA librar- 



WO 90/11089 -30- _„ 

PCT/US90/01348 



xes were derived from nucleic acid sequences present in 
the plasma of an HCV-infected chimpanzee. The construe 

5 ln°Z°* ° f thSSe libraries ' the -c- library (ATCC No. 

40394), was reported in EPO Pub. No. 318,216. Several of 
the clones containing HCV cDNA reported herein were 
obtained from the - c - library. Although other clones 
reported herein were obtained from other HCV cDNA librar- 
io xes, the presence of clones containing the sequences in 
the »c- library was confirmed. As discussed in EPO Pub 
No. 318,216, the family of HCV cDNA sequences isolated 
from the »c« library are not of human or chimpanzee 
origin, and show no significant homology to sequences 
is contained within the HBV genome. 

The availability of the HCV cDNAs described 
herein permits the construction of polynucleotide probes 
which are reagents useful for detecting viral 
polynucleotides in biological samples, including donated 
2q blood. For example, from the sequences it is possible to 
synthesize DNA oligomers of about 8-10 nucleotides, or 
larger, which are useful as hybridization probes to detect 
the presence of HCV RNA in, for example, donated blood, 
sera of subjects suspected of harboring the virus, or cell' 
^ culture systems in which the virus is replicating. In 
addition, the cDNA sequences also allow the design and 
production of HCV specific polypeptides which are useful 
as diagnostic reagents for the presence of antibodies 
raised during HCV infection. Antibodies to purified 
3Q Polypeptides derived from the cDNAs may also be used to 
detect viral antigens in biological samples, including, 
for example, donated blood samples, sera from patients 
with NANBH, and in tissue culture systems being used for 
HCV replication. Moreover, the immunogenic polypeptides 
3s disclosed herein, which are encoded in portions of the ORF 
of HCV cDNA shown in Fig. 17, are also useful for HCV 
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screening, diagnosis, and treatment, and for raising anti- 
bodies which are also useful for these purposes. 

In addition, the novel cDNA sequences described 
herein enable further characterization of the HCV genome. 
Polynucleotide probes and primers derived from these 
sequences may be used to amplify sequences present in cDNA 
libraries, and/or to screen cDNA libraries for additional 
overlapping cDNA sequences, which, in turn, may be used to 
obtain more overlapping sequences. As indicated infra, 
and in EPO Pub. No. 318,216, the genome of HCV appears to 
be RNA comprised primarily of a large open reading frame 
(ORF) which encodes a large polyprotein. 

The HCV cDNA sequences provided herein, the 
polypeptides derived from these sequences, and the im- 
munogenic polypeptides described herein, as well as anti- 
bodies directed against these polypeptides are also useful 
in the isolation and identification of the blood-borne 
NABV (BB-NANBV) agent(s). For example, antibodies 
directed against HCV epitopes contained in polypeptides 
derived from the cDNAs may be used in processes based upon 
affinity chromatography to isolate the virus. 
Alternatively, the antibodies may be used to identify 
viral particles isolated by other techniques. The viral 
antigens and the genomic material within the isolated 
viral particles may then be further characterized. 

In addition to the above, the information 
provided infra allows the identification of additional HCV 
strains or isolates. The isolation and characterization 
of the additional HCV strains or isolates may be ac- 
complished by isolating the nucleic acids from body 
components which contain viral particles and/or viral RNA, 
creating cDNA libraries using polynucleotide probes based 
on the HCV cDNA probes described infra . , screening the 
libraries for clones containing HCV cDNA sequences 
described infra., and comparing the HCV cDNAs from the new 
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"r b ?ot 8 - other methods f ~ "-'^'h" 8 "ai; y 
» - ^r- zzz hit in tha - 

isolation of g ey cDNA ggg ug 

extenri , K ThS n ° Vel HCV cDN * sequences described infra 
extend the sequence of the cDNA to the HCV «.„ 
15 in EPO Pub. No 318 mu genome reported 

nucleotides „os. -319 to !348 of the composite HCV cdn* 
sequence. (The negative number on e nucleotide "'Z.^ 

llllT, Ma Md 163h lis Oownstreem of the 

25 Of IT SSqUenCe ' ^ yieW "-^«ides nos. 8659 to 8866 
of the composite sequence. The composite HCV cDNA 
sequence which includes the sequences in tufafc- 
mentioned clones, is shown in Fig. 17 

isolated n ° Vel CDNAS herein were 

30 ° f HCV CDH& Eluding 

The HCV t™17J in lambda 91:11 < ATCC *>• 40394) 
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tion of cDNA libraries to the viral genome. The 
procedures for isolation of putative HCV particles and for 
constructing the "c" HCV cDNA library is described in EPO 
Pub. No. 318,216. other methods for constructing HCV cDNA 
libraries are known in the art, and some of these methods 
are described infra., in the Examples. Isolation of the 
sequences was by screening the libraries using synthetic 
polynucleotide probes, the sequences of which were derived 
from the 5 ' -region and the 3 ' -region of the known HCV cDNA 
sequence. The description of the method to retrive the 
cDNa sequences is mostly of historical interest. The 
resultant sequences (and their complements) are provided 
herein, and the sequences, or any portion thereof, could 
be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
using methods similar to those described herein. 

Prepara tion of Viral Polypeptides and Fragments 

The availability of HCV cDNA sequences, or 
nucleotide sequences derived therefrom (including segments 
and modifications of the sequence), permits the construc- 
tion of expression vectors encoding antigenically active 
regions of the polypeptide encoded in either strand. 
These antigenically active regions may be derived from 
coat or envelope antigens or from core antigens, or from 
antigens which are non-structural including, for example, 
polynucleotide binding proteins, polynucleotide 
polymerase(s) , and other viral proteins required for the 
replication and/or assembly of the virus particle. Frag- 
ments encoding the desired polypeptides are derived from 
the cDNA clones using conventional restriction digestion 
or by synthetic methods, and are ligated into vectors 
which may, for example, contain portions of fusion 
sequences such as beta-galactosidase or superoxide 
dismutase (SOD), preferably SOD. Methods and vectors 
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October 1, 1986. Vectors for the expression of fusion 
polypeptides of SOD and HCV polypeptides encoded ZT 
number of HCV clones ere described infra., in the 
Examples. Any desired portion of the HCV cdna contain!™ 

" o a bta P8 Y ea<iing £rMe ' in Sither Sen - *«— . can be 
obtained as a recombinant polypeptide such „« T 

the CDMA can be provided by chemical synthesis. 

The DKA encoding the desired polypeptide 

« I 11 "/" fU3ed » *™. — "bethel or 

contarnrng a signal sequence to permit secretion, may be 
irgated into expression vectors suitable for any 
convenient host. Both euxaryotic and prokaryotL host 

poC: t a d re presentiy usea in -^an 1 : host 

20 conSof J S "~ ° £ SO " e ° f the — «mmo„ 

control systems and host cell lines is given infra The 

polypeptide is then isolated from lysed celL of L « 
cult ium and purified to the extent needed foTits 

the at PUr "" atio « ^ *° by techniques ta J n 

25 fract f! r MamPle ' di «~*""al extraction, salt 

fractionation, chromatography on ion exchange resins af 
fmity chromatography, centrifugation, and L Hke see 
method"^ S ' Meth ° dS ^ En *™* «« a variety of ' ' 

used as' SI PUr ," ying Pr ° telnS - SUCh peptides can be 
30 used as diagnostics, or those which give rise to neutral! 

bodie antlb ° dieS ^ bS £ °™^ated into vaccines. Anti- 
bodies raised against these polypeptides can also bf used 
as diagnostics, or for passive Immunotherapy. In add"! 
tion, as discussed infra., antibodies to these 
polypeptides are useful for isolat.-n^ . . ^ 
35 particles. "olatxng and identifying HCV 
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Preparation of An tigenic Polypeptides and Conjugation with 
Carrier 

An antigenic region of a polypeptide is gener- 
ally relatively small— typically 8 to 10 amino acids or 
less in length. Fragments of as few as 5 amino acids may 
characterize an antigenic region. These segments may cor- 
respond to regions of HCV antigen. Accordingly, using the 
cDNAs of HCV as a basis, DNAs encoding short segments of 
HCV polypeptides can be expressed recombinantly either as 
fusion proteins, or as isolated polypeptides. in addi- 
tion, short amino acid sequences can be conveniently 
obtained by chemical synthesis. In instances wherein the 
synthesized polypeptide is correctly configured so as to 
provide the correct epitope, but is too small to be im- 
munogenic, the polypeptide may be linked to a suitable 
carrier . 

A number of techniques for obtaining such link- 
age are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3- ( 2-pyridyl- 
thio) propionate (SPDP) and succinimidyl 4- (N-maleimido- 
methyl)cyclohexane-l-carboxylate (SMCC) obtained from 
Pierce Company, Rockford, Illinois, (if the peptide lacks 
a sulfhydryl group, this can be provided by addition of a 
cysteine residue.) These reagents create a disulfide 
linkage between themselves and peptide cysteine residues 
on one protein and an amide linkage through the epsilon- 
amino on a lysine, or other free amino group in the other. 
A variety of such disul fide /amide -forming agents are 
known. See, for example, Immun. Rev. (1982) 62:185. 
Other bifunctional coupling agents form a thioether rather 
than a disulfide linkage. Many of these thio-ether- 
forming agents are commercially available and include re- 
active esters of 6-maleimidocaproic acid, 2-bromoacetic 
acid, 2-iodoacetic acid, 4-(N-maleimido- 

methyl)cyclohexane-l-carboxylic acid, and the like. The 
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rotavirus/' •binding peptide- svst.™ h ^ploys the 

*• W,l«, f the discos! oT^chT ^ PUt - 

herein by reference. The tjl^^^^ 
bs exhaustive, and modifications of th» 7 t0 
10 can ciearly be used. * naned ""Pounds 

induce theZd™ oTa^ibT T ^ 
Suitable carriers are £££ "\ t0 the — 
macromolecules such as L. ° ' °" ly "Obelized 

iulose beads and the lik.. „„, garose ' cellulose, cel- 
polyglutamic acid polyline ^ SUCh " 

copolymers; and inaSv^ ^? — ~" 

fui protein substrates are seru! !^b Es * ecia "X »«- 

hemocyanin, i,m„uno,lobulI„ ^Z^TT' keyh ° la X ^ 
ovalbumin, tetanus toxoid ~d oth thyr °» lob «"n, 
to those skilled in the art P^ins well known 

Polypeptide^ tZtT^l^^ 
encoding at least one virai en^ aCid ss 9»ono« 

munological reagents. Po" examplT "T MefUl 
ing such truncated seguences Z bl 

immunoass.y. These Jly£ptid« a, " re * 9entS ln an 

subunit antigens in ZJ£££Z foTaT """"^ 
or vaccines, while these 9 ™ Production 

produced by various known treats oT^" * 
Protein, it is generally pre ferae to Ll 
recombinant polypeptides ™" c « 

Polypeptides comprising these truncaced hcv sT""' 
35 be made up entirely of HCV sequent Tone Z Can 
eDitoDPc **<+.u ^ uences (°ne or more 
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Useful heterologous sequences include sequences that 
provide for secretion front a recombinant host, enhance the 
immunological reactivity of the HCV epitope (s , or *** 
facilitate the coupling of the polypeptide to an 
immunoassay support or a vaccine carrier. See, e g EP0 
Pub No. U6,201; U.S. Pat. Ho. 4,722,840; EPO Pub Z 
259,149; U.S. Pat. No. 4,629,783, the disclosures of 
io which are incorporated herein by reference. 

The si ze of polypeptides comprising the 

truncated HCV sequences can vary widelv tho \ • • 
hoin „ „ „ vary wiaeiy, the minimum size 

bexng a sequence of sufficient size to provide a „ HC v 
epitope, whiie the maximum size is not critical. For 
convenience, the maximum size usually is not substantially 
greater than that reguired to provide the desired HCV 
epitopes and functions, of the heterologous sequence if 

wiTi ra^e 1C f lly ' T ^ «" *~ 

wxll range from about 5 to about 100 amino acids in 

20 T tyPiCa " y ' hOWev «' ^e HCV sequence will be 

a maxxmum of about 50 amino acids in length, preferably a 
maxxmum of about 30 amino acids, it is usually desirable 
to select HCV sequences of at least about 10, 12 or 15 
amxno acids, up to a maximum of about 20 or 25 amino 

25 aCiC * s • 

Tru "« ta -1 HCV amino acid sequences comprising 
epitopes can be identified in a number of ways . For 
example, the entire viral protein sequence can be screened 
by preparxng a series of short peptides that together span 
30 the entxre protein sequence. An example of antigenic 

infrr in dd th :. re9i ° nS ° f ^ HCV *°**P«*-t» *• 

-ddxtxon, by starting with, for example, lOOmer 

polypeptides, it would be routine to test each polypeptide 

ftv a e d P :r en ° e °* epit °^< S ' a desired reactiv 

35 xty, and then testing progressively smaller and overlap- 
pxng fragments from an identified lOOmer to map the 
epxtope of interest. Screening such peptides in an 
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inununoassay is within the skill of t ho 

known to carry out * „ art * Xt is also 

rry out a computer analvsis nf = 
_ sequence to identifv now- anal Y 33 - s of a protein 

screening Such . computer £££2 IT°£ £ 
sequence is shown in Fig 20 whl Ju Mlln ° acid 

hydrophobic character Is* , ^ ^P^ 1 ^/ 

in index. The amino acids are ^ * nti9en 

10 (Position l, as sho^n Ih plq^lt^ * « 
those of skill in the art that such ! " aPPreCiated h Y 

comprised o£ clone™r 81 c«r e r i0n VeCt °" 
C33b, CA290a, C8 f, Cl 2 f, ' 

HCV amino acid sequences th— v examples of 

herein are sat [ b^W "^Ll " ^'"^ 

these peptides do not necess Jilv ""darstood that 

epttopa, and may also oontal^rv Pre ° iSSly ™ a P <— 
„ immunoganio. Thasa LTT sequence that Is not 

25 nese n °n-xmmunogenic oortinnc a.^ 

sequence ca n ba dafinad as described !f * 
conventional techniques ^JllllZ^VT' 
sequences. Further , additional ^^d H^v ^ 
sequences that comprise an epitopTor are 
30 be identified as described above The Z^™ 9 ^ °" 
sre given by amino acid number (i.. .f^T "» se *«" =es 
the ammo aci d number as shown ^ J£ » » is 

J5 AA40-AA90; AA45^AA65 ■^IsS^f j AASO-AAlOO; °' 

AA95-AAU0; JUUoSvj& J0 ^j£ft 0 S » **80-90,- AAS9-AA120- 
AA155-AA170; « l9 0-^l°6,^%^„ ; ^0-h«oo^ 
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^■245-AA265; AA250-AA300 ; AA290-AA330- AA290 
AA300-AA350; AA310-AA330 AA350-AA400- AA380 
AA405-AA495; AA400-AA450 AA405-AA415- AA415 
AA425-AA435; AA437-AA582 AA450-AA50o' AA440 
AA460-AA470; AA475-AA495 AA500-AA550- AA511 
AA515-AA550; AA550-AA600 AA550-AA625- AA575 
AA585-AA600; AA600-AA650 AA600-AA625 AA635 
AA650-AA700; AA645-AA680 AA700-AA750- AA700 
AA700-AA750; AA725-AA775 AA770-AA790; JjSSa 
AA800-AA815; AA825-AA850; AA850-AA875 AA800- 
AA920-AA990; AA850-AA900; AA920-AA945 AA940- 
So^JS^^ 950 ^ 1000 ' AA1000VUi6 6 St 



35 



AA1000-AA1025; AA1000-AA1050 
AA1040-AA1055; AA1075-AA1175 
AA1070-AA1100; AA1100-AA1130 
AAJ192-AA1457; AAl 195-AA1250 
AA1225-AA1250; AA1250-AA1300 
AA1260-AA1280; AA1266-AA1428 
AA1290-AA1310; AA1310-AA1340 
AA1345-AA1365; AA1350-AA1400 
AA1380-AA1405; AA1400-AA1450 
AA1460-AA1475; AA1475-AA1515 
AA1500-AA1550; AA1500-AA1515 
AA1550-AA1600; AA1545-AA1560 
AA1570-AA1590; AA1595-AA1610 
AA1610-AA1645; AA1650-AA1690 
AA1689-AA1805; AA1690-AA1720 
AA1720-AA1745; AA1745-AA1770 
AA1775-AA1810; AA1795-AA1850 
AA1900-AA1950; AA1900-AA1920 
AA1920-AA194 0; AA1949-AA2124 
AA1950-AA1985; AA19 80-AA2000 
AA2005-AA2025; AA2020-AA2045 
AA2045-AA2070; AA2054-AA2223 
AA2100-AA2150; AA2150-AA2200 
AA2200-AA2325; AA2250-AA2330 
AA2265-AA2280; AA2280-AA2290 
AA2300-AA2350; AA2290-AA2310 
AA2330-AA2350; AA2350-AA2400 
AA2345-AA2415; AA2345-AA2375 
AA2371-AA2502; AA2400-AA2450 
AA2415-AA2450; AA2445-AA2500 
AA2470-AA2490; AA2500-AA2550 
AA2535-AA2560; AA2550-AA2600 
AA2600-AA2650; AA2605-AA2620 
AA2640-AA2660; AA2650-AA2700 
AA2670-AA2700; AA2700-AA2750 
AA2750-AA2800; AA2755-AA2780 
AA2780-AA2830; AA2785-AA2810 
AA2810-AA2825; AA2800-AA2850 
AA2850-AA2865; AA2885-AA2905 



-305; 

-AA395; 

-AA425; 

-AA460; 

-AA690; 

-AA605; 

-AA665; 

-AA725; 

-AA800; 

-AA850; 

-AA965; 



AA1025-AA1040 
AA1050-AA1200 
AA1140-AA1165 
AA1200-AA1225 
AA1260-AA1310 
AA1300-AA1350 
AA1345-AA1405 
AA1365-AA1380 
AA1450-AA1500 
AA1475-AA1500 
AA1515-AA1550 
AA1569-AA1931 
AA1590-AA1650 
AA1685-AA1770 
AA1694-AA1735 
AA1750-AA1800 
AA1850-AA1900 
AA1916-AA2021 
AA1950-AA2000 
AA2000-AA2050 
AA2045-AA2100 
AA2070-AA2100 
AA2200-AA2250 
AA2255-AA2270 
AA2287-AA2385 
AA2310-AA2330 
AA2348-AA2464 
AA2370-AA2410 
AA2400-AA2425 
AA2445-AA2475 
AA2505-AA2540 
AA2560-AA2580 
AA2620-AA2650 
AA2655-AA2670 
AA2740-AA2760 

AA2796-AA2886; 
AA2850-AA2900; 
AA2900-AA2950; 
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AA2925- AA29 50, A A2 9 45-e„ d(c . 

The above HCV amino acid sequences can be prepared as 
discrete peptides or incorporated into a iZZ" 
polypeptide, and may find use as described herein Ad 
dxtronal polypeptides comprising truncated HCV 1 
are described in the examples. sequences 

10 , The ° bse "ed relationship of the putativ* 

polyproteins of HCV and the Plaviviruses alw some 
Prediction of the putative domains of the HcT'noT 
structural- <HS, proteins. The locations o^the 
-dividual NS proteins in the putative Flavins pre 
cursor polyprotein are fairly well-known „ P 
also coincide with observed ^ . ^Z^Z'lL^" 
hydrophobicity profile of the polyprotein it is 
established that HS5 of Flavivirusls encodes the virion 
polymerase, and that NSl corresponds with a cZ, , 
20 fixation antigen which has been'shown * be Tef^t' 
vaccine in animals. Recentlv <* i, >. effective 
flaviviral protease funl^on'reside iXVT " 
observed similarities betwen HCV and the"' 1^1^.^ 
o e t r io b „ e s d 0 f f t r he'c dedUCtl0nS C ""* the 

25 tions Tthf HCV "ivT! 9 Pr0tei " d0n,ai " S f »~ 
tne HCV polyprotein are possible = 

°f Polypeptides containing these doma^ l n a ^tTof 

yeast i"" d CBllS ' lnClUdln " *» 

yeast, insect, and vertebrate cells, should oive ri== , 

regxons which are towards the N-terminus r n ^ rUCtUral 
there is a oreater di™^ . teriUlnus * In «»xs region, 
a greater divergence in sequence, and in addi- 
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tion the hydrophobic profile of the two regions show less 

rZo~t?' h ThiS " di — e " ^ the H-terminal 

5 HI W ° f **• PUtative NS1 dom ^ ^ HCV, and extends to 
the pressed N-terminus. Nevertheless, it may still be 
possible to predict the approximate locations of the puta- 
tive nucleocapsid (N-terminal basic domain) and E (gener- 
ally hydrophobic) domains within the HCV polyproteil in 
iQ the Examples the predictions are based on the changes 
observed m the hydrophobic profile of the HCV 
polyprotein, and on a knowledge of the location and 
character of the flaviviral proteins. From these predic- 

oTlLZTLT P T ible ^ ±dentify regions 
is of the HCV polyprotexn that could correspond with useful 

xmmunological reagents. For example, the E and NS1 

proteins of Flaviviruses are known to have efficacy as 

protective vaccines. These regions, as well as some which 

are shown to be antigenic in the HCV isolate described 

2q herexn, for example those within putative NS3 , C, and NS5 
etc should also provide diagnostic reagents. Moreover,' 

Q ti0 " ^ ******* ion ° f viral-encoded enzymes may 
also allow the evaluation of anti-viral enzyme inhibitors, 
x.e for example, inhibitors which prevent enzyme activ- 

25 xty by vxrtue of an interaction with the enzyme itself or 
substances which may prevent expression of the enzyme, 
(for example, anti-sense RNA, or other drugs which 
xnterfere with expression) . 
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Preparation of Hybrid Particle Immunoo^ Containing gCV 
Epitopes a ■ 

The immunogenicity of the epitopes of HCV may 
also be enhanced by preparing them in mammalian or yeast 
systems fused with or assembled with particle-forming 
proteins such as, for example, that associated with 
hepatxtis B surface antigen. Constructs wherein the NANBV 
epitope xs linked directly to the particle- forming prote!! 
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coding sequences produce hybrids whinh 
1th respect to the HCV epitope i„ 

vectors prepared include e^tol additio »' of the 

5 various degrees of iJ^SS ^ *° ^ ^ 

the pre-S peptide. Thus ^ SUCh * S ' for -"Pi., 

Particle forcing P^\J^JT^ 
— unogenic with respect to HCV and HBV are 
10 Hepatitis surface antioen fm.c a „ ( 

to be formed and assembled into plrtlcTes ^ ^ 
(Valenzuela et al. (1982)) P artlc ^ xn s. cerevisiag 

mammalian cells (Valenzuela p Z *" ^ exam P le ' 

Nation of such P artT les h as 'b (1984,) - 

, _ immunogenicity of th« . e " Sh ° Wn t0 e nhance the 

15 , y llJ - GX ^Y ot the monomer subunit Th a „ 

also include the immunodominant ep £ pe ^f!""™"* »<* 
ing the 5S amino acids of p "ope °* HBSAg, compris- 

Neurath et al. (19 ^ °* ~*« (Pre-S, region, 

particle expressing In yeast^e ^ ^"S^ 

20 "4,444, pubUshed ^T^.^r EP ° 
heterologous viral sequences for *«"l«ttn, 
disclosed in EP0 175 «? \, ye<,St BX P r9 «i°n are 
constructs -/also L e ' P e U ! s ll f ^ 26 ' These 

« Chinese haLter ova^O cell" ^ SUC " 

2S dih y dro f ol a te reductase™ SSJH:^^,, 

ing an HCV epitope l„ .... ^Placed „ lth codons encod- 

are not reguled^o J^^~" 
30 to for. immunogenic par tic l e L HZTltZ °' ^ U " ltS 
deleted, thus eliminatino add.^ f mammals can be 

f - -Petition „ith t„e' HcTep^pe^ 

Preparation »-f yaccin, 

35 oogenic ZrlZ^Z T " — ^ 

the cDNA seguences descrihedln^ Z^Z'. 
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r^o™«- h0m ° 109y bStWeen HCV and »»^™« Provides 

(1986, The fiavivirus genomic rna is believed to be 

Into r h T: h " SPeC " 1C SPeCieS ' " " translated 

into the three viral structural proteins, i.e C M „ „ 

isrz 11 as t iarge ——i p^tei^: 

NS5 and a complex set of smaller nonstructural proteins 
It is known that major neutralizing epitopes for 

(1986)). Thus, vaccines may be comprised of recombinant 
polypeptides containing epitopes of HCV E. These 
polypeptides may be expressed in bacteria, yeast „, „ 
malian cells, or alternatively may be isolaw from viral" 
preparations, it is also anticipated that the othL 

20 r"e C to r orore°t elnS ^ ^° """^ ° Plt °**° ^ *~ 
rise to protective anti-HCV antibodies. Thus 

polypeptides containing the epitopes of E C '.nrf » 

abused, whether singly or in ^ZT^ 

» —on w^l T^l^l^ 

r^T tl z a9ainst yeiiow fever ^°^*"T 

(1986 ,. This x. true even though the immunisation does 
not give rise to neutralising antibodies. Thus, 

30 Particularly since this protein appears to be highly 
conserved among Flavi viruses if i„ , •,. , t 
will .l„„ t. ^ iviruses, it is likely that HCV NS1 
it «l Protective against HCV infection. Moreover 

it also shows that nonstructural proteins may provide 
protection against viral pathogenicity, even if they do 

35 not cause the production of neutralising antibodies 
The information provided in the Examples 
concerning the immunogenicity of the polypeptides 



WO 90/11089 _ 44 _ 



PCT/US90/01348 



expressed from cloned HCV cdnas which 

regions of the HCV ORP also alL^ ^ VS * i0aS 

s their use in vaccines. Predictions concerning 

In view of the above, multiv.l= n <. 
against HCV may be comprised of " U " 1Valent vaccinas 
one or more structural^tLI Z7 "™ ^ 
epitopes from one or more nlZ'^Zll Z£ 
vaccines may be comprised of, for exa^l! TheSe 
Polypeptides and/or polypentld.! • ' recombi nant HCV 

virions. In particular vfc ^ ^ the 

comprising o„: or mor of ™ ^ ~» t — l «" 
subunit antigens derived there f ' W protei -- or 

„ NS4 and NS5 . Particularlv 1 , W1 ' C ' NS2 < 

" ing E and,or Hsi, oTsutn^tnerTf "* 

i^unogenJ'por^:^;:, 0 ' VaCCin " Whic " ""tain an 
to one sKillL lithe art in9 ~ di -ts, is Known 

20 P-pared as in^tabW T^TZ^TllT™ "~ 
suspensions; solid forms suitable for sol u tL " " 
suspension in, l iguid prior solution rn, or 

prepared. The p MD „,f- injection may also be 

Protein encapsulated 1 TZ Z ^ * —I-"*-, or the 
25 ingredients are of ten mi^T. ^ ^ Oogenic 
pharmaceutically accept! ! ! eTC1 P ient s which are 
ingredient. ^Labl^^ts «- active 
saline, dextrose, glycerol IT f example , water, 

combinations thereof in n! 1 ' ° r the like —« 
30 cine may container amou^T^^ * — 
such as wetting or emulslfvin„ auxlll ary substances 
agents, and/or adjuvants which T'' P " *""«*». 
°f the vaccine. Examples of ad! effe «-oness 
five include but arHot llltl*™" WhiC " be e ffec- 

35 --^--^i-^e^^^ 

referred to as nor-^P) f I ^ (CGP 11637 ' 

r mdp) r N-acetylmuramyl-L-alanyl-o- 
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TZ7ltT ph °7 loxy> -" hylamine (CGP 1S835A - ~ 

5 extracted ; k ' " hiCh C ° ntainS three =-P-~»t. 

extracted from bacteria, monophosphoryl lipid A , trehalose 
dimycolate and cell wall ske l eton lm ^Jc^ ) la . 2* 
squalene/Tween 80 emulsion. The effectiveness of an 
adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV antigenic sequence resulting from 
administration of this polypeptide in vaccines which are 
also comprised of the various adjuvants. 

The vaccines are conventionally administered 

» o"r terally ' ^ *• —Pie, either suocutane- 

ously or intramuscularly. Additional formulations which 
are suitable for other modes of administration incite 
suppositories and, in some cases, oral formulations For 
suppositories, traditional binders and carriers may 

2o include, for example, polyalfcylene glycols or 

triglycerides; such suppositories may be formed from 

n*Tl D r aln f' a " iVe *« the range of 

0.5* to 10%, preferably l*. 2 %. oral formulations include 
such normally employed excipients as, for example, 
2s pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharine, cellulose 

taKnH U V" b0nate ' ^ UkS - These "-"Positions 

take the form of solutions, suspensions, tablets, pills 

capsules, sustained release formulations or powders and' 

3Q contain 10%- 95 * of active ingredient, preferably" *!^ 

The proteins may be formulated into the vaccine 

salts'^ 7 f0rmS - ically acceptable 

salts include the acid addition salts (formed with free 
amino groups of the peptide, and which are formed with 
35 inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic 
oxalic, tartaric, maleic, and the like. Salts formed with 
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5 ammonium, calcium, or'£e«ic X 3 .r^™' ^ssium, 
bases as i^mi^^J^T '' T ~ h 
ethanol. histidine, procain ^ ^j-hylamxno 

22SSa!L ^^2iStrati^^ 

as will be P r 0ph TJZ:iVZZT' ^ ^ ^ <™ 
tive. The quantity to be Lmi„ ! ther apeutically effec- 
m the range of s luj^^ "Tit ^ ^ 
15 Per dose, depends on the sublet to b T"™" ° £ ""^ 
of the subject's immune svsteH tre ated, capacity 

and the degree of v^llTJslTT" 
active ingredient required to hi SCiSe amounts °* 

°n the Judgment of the practitlo admlnistered depend 
20 each subject. P«ct lti oner and may be peculiar to 

schedule, Ir 9 pref«2 , " ay 9lVe " ln a Sin * ie «°se 

-itipxe'dose'sched^:^"""^ 13 *~ * 
vaccination may be „ith l ,n " 3 P rin "ary course of 

„ other doses gi.en at ** 
maintain and or reenforce tne 11 lnterVais re ^red to 
exampie, a t x- 4 months f r 4 se^"^" 85 ' f « 
subsequent dose,s, after several 1 "* " " eedea ' 3 

regimen will also, at least T The aosa * e 

i0 need of the individual and be dHTa deterai rned by the 
of the practitioner. dependent upon the judgment 

In addition, the 
munogenic HCV antigen f., *L3T C ° ntainin ° «w i*. 
-on With other JS^^tSS^r " C °^— 
mune globulins. a * ents ' for example, im . 
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Preparation of a ntibodies Again.- vry Epit , 

d»»r-^K * ^ iTO,uno 9 enic Polypeptides prepared as 

5 ~ , " e USSd t0 produce antibodies, both 

polyclonal and monoclonal. If polyclonal antibodies are 
desxred a selected manual (e.g., mouse, rabbit, goat 
horse, etc., is immunized with an imtanQ P oiyp sptlde 
bearxng an HCV epito P e,s,. serum tlom the 
io anxmal xs collected and treated according to known 

an°HCV U eoI;o " Polyclonal antibodies to 

an HCV epitope contains antibodies to other antigens, the 
polyclonal antibodies can be purified by immunoaff inltv 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for example 
Mayer and Walker (1987). =*ampxe, 

isol„-~. /""""^ly- Polyclonal antibodies may be 
with „CV 2 ' "T 1 " hiCh h " mooted 
lo HCV eoitol T P 8 ™ eth0d £ ° r antibodies 

20 H T eP " 0peS fr ° m serum £ *°» infected individual, 
based upon affinity chromatography and utilizing a fus l on 

cTonTs 1 I ^ 3 POl ^"° o-oded within cDNA 

clone 5-1-1, 1S presented in EPO Pub. No. 318,216 

Monoclonal antibodies directed against HCV 

25 the art" *Z ^ * by °" e Sk " le ° 

antinod^ T methodol ogy for making monoclonal 

antxbodxes by hybridomas is well known, immortal 
antibody-producing cell lines can be created by cell fu- 

fo^It and ^ ° ther techni 9U" such as direct trans- 

3Q formation of B lymphocytes with oncogenic dna, or 
transfection with Epstein-Barr virus. See, e.o m 

etT^ao? 1 ' <198 ° ,; Ha — ^etal. ,1981,',: Kennett 
et al. (1980); see also, U.S. Patent Hos. 4,341,761; 

' ' »V 4 ' d "'I 83; 4 ' 444 ' 887 ' 4 < 466 ' 917 ' ^^oo ; 

35 oroducl M "' 9 "' PanelS ° f »°no=lonal antibodies 

produced agaxnst HCV epitopes can be screened for various 
properties; x.e., for isotype, epitope affinity, etc. 
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useful in diagnosis and 1 eplt ° pes *™ Particularly 

= i„ pass 9 ::: 1 ^^:^ 1 ""? « 

in particular, may be used "^ais™ " antib ° die *' 
ies. raise antr-idiotype antibod- 

which carrra"""„t°e^a e i antib ° dieS "™ ^^bulin, 
" infectious^t^ ZT, ^ f «» 

15 are fcnown ITE^T. ^l^*"":" 1 "^ 

15 MacNamara et al ( i 9e4l !'„ SXM1 P le - <*zych (1985), 

treatment and/or diagnosis of s *• 

elucidation of the l—*^^^"*? ~ 

- -u ta t;: r v-: by f 

directed against HCV epitopes Lv be ° -"bodies 

herein, the term ^T"^ *" " Sed 

group of polvceDtido. , v ISrS to a Polypeptide or 
„ antibody IJ^T^^.^^ «* « -ast one 
" "binding domain" is fo™.H 7 antlbod y combining site" or 
-mains of an -Z^Z^T^Z 
dimensional binding spaces with an * 
and charge distributio'n ^L^TSL"^ 
an epitope of an antigen, which allows an t- T" ° f 
reaction with the antiaen , axiows an immunological 

he formed from a ZZ l^oT.T^^ 1 ^ 

VL, respectively, , J ch ZZ hype™ r IabT ^ 

5 -as, C^™-; 
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antibodies, the Fab urotsln« =„.« . • 

iss Proteins, and single domain anttbod- 



5 which ■. T in antibo<1 y" is an antibody 

which is comprised of an VH domain, whioh reaots im- 
munologically with a designated antigen. A dAB does not 
contain a VL domain, but may contain other antigen binding 
domains known to exist in antibodies, for example, the 
kappa and lambda domains. Methods for preparing d^s are 
known in the art. See, for example, ward et al (i^t 
d— • Antlb ° die = "V also be comprised of VH and VL 

Examples of these types of antibodies and methods for 
15 their preparation are known in the art (see, e.g. , „ S 
Patent Ho. 4,816,467. which is incorporated herein by 

"!=TT e> ' ^ inClUdS thS £ °"°»i"9- For example, 
vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
20 heavy chains which are usually aggregated In 

configuration and which may or may not have covalent link- 

allnoacT" ChainS - *" antibodies, the 

amino acid sequences of all the chains of a particular 
antibody are homologous with the chains found in one^nti 
2S body produced by the lymphocyte which produce, that any- 
body in situ, or in vitro (for example,, in hybridomas? 
Vertebrate antibodies typicallly include native antibod- 
ies, for example, purified polyclonal antibodies and 
monoclonal antibodies. Examples of the methods for the 
30 preparation of these antibodies are described infra 

oair „ f h " Hybr " antiboaies - «e antibodies wherein one 
first a t^ 3 ll9ht ChainS iS h °^°9°- t= those in a 

chains T s T y : " hlle ° ther P3lr ° f heav * «- "** 
chains is homologous to those in a different second anti- 

35 body Typically, each of these two pairs will bind d Tf. 

resuItsTn'rhr' P " tiCUlarl * °" ^"erent antigens. This 
results in the property of "divalence" , i.e., the ability 
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to bind two antigens simultaneously Suc h i h ... 
also be formed using chineric .J- ^ may 

5 Chimeric antibodies" =,r- Q ' 

£S r- "-^ssl 1 ? which 

are from a different species ^/oTcia^^T^-'T 1 " 3 
i. any antibody in „ hich elther „ J^"* " S ° lncl «^ 
light chains are composed of ! heaVy ° r 

r = W n 3 th e ^srr-^sr- 
::«:::; t ~ sources be or 

„ Point is at the ^iao^r' ?* " "* the f «*°" 

15 « « - _ e var iable/constant boundary Th,,o , 

sequences. I*^^^^*""" 
construct antibodies whose var^M eXamPle ' t0 

,„ specific affinity for 1 ? °" 3 hl9her 

20 constant region can elLf, T anti,en ' ° r Whose 

or to ma K e IZr Z^nTZV """^ 
particular constant region W"" possessed by a 

refers to a^ib^Hn^^ '"^^ ■ <*ich 

5 amino acid ~£Z\?::£J£™™»1 °r Urri " 9 
varied. utilising recombinant DNA tech 

oan be redesigned to obtain desired a "tibodies 
possible variations are many and CharaCteristi o= • The 
, of one or more amino acids to the ? 

region, for examnle th oomplete redesign of a 

r 4. example, the constant reni'nn 

constant region, in general, to attain d„ • ^ **" 

process characteristics e a k desired cellular 

fixation, interact" withmemh ^ *" 

functions, changes ln the ^T' ° tb " 

alter antigen binding characerisrics" 9 ™" "* t0 

also be engineered to aid th ^ ic I 7^ 
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molecule or substance to a specific cell or tissue site 
The desired alterations may be made by known techniques in 
5 molecular biology, e.g., recombinant techniques, site 
directed mutagenesis, etc. 

Yet another example are "univalent antibodies" 
which are aggregates comprised of a heavy chain/light ' 
chain dimer bound to the Fc (i.e., constant) region of a 
io second heavy chain. This type of antibody escapes 

antigenic modulation. See, e.g., Glennie et al . (1982) 

Included also within the definition of antibod- 
ies are "Fab" fragments of antibodies. The "Fab" region 
refers to those portions of the heavy and light chains 
^ which are roughly equivalent, or analogous, to the 

sequences which comprise the branch portion of the heavy 
and lxght chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion . "Fab" includes aggregates 
2q of one heavy and one light chain (commonly known as Fab') 
as well as tetramers containing the 2H and 2L chains 
(referred to as F(ab) 2 ), which are capable of selectively 
reacting with a designated antigen or antigen family. 
"Fab" antibodies may be divided into subsets analogous to 
25 those described above, i.e, "vertebrate Fab - , "hybrid 
Fab", "chimeric Fab", and "altered Fab". Methods of 
producing "Fab" fragments of antibodies are known within 
the art and include, for example, proteolysis, and 
synthesis by recombinant techniques. 



30 



II,H - Diagnostic Oligonuc leotide Probes and Kii- a 

Using the disclosed portions of the isolated HCV 
cDNAs as a basis, oligomers of approximately 8 nucleotides 
or more can be prepared, either by excision or 
35 synthetically, which hybridize with the HCV genome and are 
useful in identification of the viral agent(s), further 
characterization of the viral genome(s), as well as in 
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detectxon of the viruses, i„ diseased individuals. The 

I 1 f V P ° 1 ^ lao "— (-turai or derived, are a 
s length which allows the detection of unique viral 

TllTT, T hybridi2ation - 6-8 nucleotides may be 

a workable length, sequences of 10-12 nucleotides are 
preferred, and about 20 nucleotides appears optical. 
Preferably, these sequences will derive from regions which 
lack heterogeneity. These probes can be prepared "sine 
routine methods, including automated oligonucleotide 

those a de C m Tf " A " 0ng USefUi Pr ° beS ' f ~ — 
those derived from the newly isolated clones disclosed 

CD^TIV 3 WSl1 " V " iOUS useful in probing 

IS CDBA llb " ri * f °"" "slow. A complement to any 
unrque portion of the HCV genome will be satisfactory^ 
For use as probes, complete complementarity is desirable 

iTLteaseT * ""^ " ^ « «- 

20 looioal « ° £ SU ° h Pr ° beS " dla 9"°«i=s, the bio- 

logical sample to be analyzed, such as blood or serum, may 
be treated, if desired, to extract the nucleic acioT ? 

samel T" 1 "- resulti "9 -id from the 

sample may be subjected to gel electrophoresis or other 

25 ~ h Sepa " tion "Piques; alternatively, the nucleic 
The oT "* bl °" ad " ithout oration . 

for face! 8 " e T" labelSd - SUitatle labels ' and -^ods 
examoi^ ! P "™ k " OWn in thS «*' and . ^r 

tZ v C ^ labelS lncor P°^ed by nick transla- 

3o turn or kinasrng, biotin, fluorescent probes, and 

chemiluminescent probes. The nucleic acids extracted from 
the sample are then treated with the labeled probe under 
hybridation conditions of suitable stringencies, and 
polynucleotide duplexes containing the probe are detected 

35 to the „n, Pr0beS be made co, »P^tely complementary 
to the hcv genome. Therefore, usually high stringency 
condrtxons are desirable in order to prevent false 
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posxtxves. However, conditions of high stringency should 
only be used if the probes are complementary to reg±ons 

5 ITZt gen ° me WhiCh lack h ^rogeneity. The stringency 
of hybrxdxzation is determined by a number of factors dur- 
ing hybridization and during the washing procedure 
xncluding temperature, ionic strength, length of time, and 
concentration of formamide. These factors are outlined 
iQ xn, for example, Maniatis, T. (1982). 

Generally, it is expected that the HCV genome 
sequences will be present in serum of infected individuals 
at relatively low levels, i.e., at approximately 10 2 -10 3 
chxmp infectious doses (CID) per ml. This level may 
is require that amplification techniques be used in 

hybridization assays . Such techniques are known in the 
art For example, the Enzo Biochemical Corporation "Bio- 
Brxdge" system uses terminal deoxynucleotide transferase 
to add unmodified 3 '-poly-dT-tails to a DNA probe. The 
2Q poly dT-tailed probe is hybridized to the target 

nucleotide sequence, and then to a biotin-modif ied poly-A 
PCT application 84/03520 and EPA124221 describe a DNA 
hybridization assay in which: (1, analyte is annealed to a 
sxngle-stranded DNA probe that is complementary to an 
25 ^f^-^eled oligonucleotide; and (2, the resulting 
taxied duplex is hybridized to an enzyme-labeled 
oligonucleotide. EPA 204510 describes a DNA hybridization 
assay in which analyte DNA is contacted with a probe that 
has a tail, such as a P oly-dT tail, an amplifier strand 
3q that has a sequence that hybridizes to the tail of the 

probe, such as a poly-A sequence, and which is capable of 
bxndxng a plurality of labeled strands . A particularly 
desxrable technique may first involve amplification of the 
target HCV sequences in sera approximately 10,000 fold 
35 i.e., to approximately l 0 6 sequences/ml. This may be ac- 

ll7 R \T*t' eXamPle ' by e 01 *— chain reactions 

(PCR) technique described which is by Saiki et al. (i 98 6) 
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by Mullxs, U.S. Patent No. 4,683,195, and by Mullis et al 
U.S. Patent No. 4,683,202. The amplified sequence ' 
then be detected using a hybridisation assay which Is 
described in EP 317,077, published May 24 19 8 9 The 

~r; 0 "T; which shouid deL: -~ h :rthe 

level of 10 /ml, utu.ze nucleic acid multimers which bind 
to srngle-stranded analyte nucleic acid, and which also 
10 hind to a multiplicity of single-stranded lab^ed 

oligonucleotides. A suitabie solution phase sandwich as- 
say which may be used with labeled polynucleotide probes 
and the methods for the Drenaratinn ~* > prooes, 
in EPO 22 5 pn 7 k, ! preparatlon of Probes is described 
m EPO 225,807, published June 16, 1987. 

15 n . The probes can fa e packaged into diagnostic kits 

~ C u kltS ^ ^ DNA, which may be kitS ' 

labeled; alternatively, the probe DNA may be unlabeled and 
the ingredients for labeling may be included in the ^t "n 
separate containers. The kit may also contain other suit- 

2Q ably packaged reagents and materials needed for the 

IT^TTs hyb f di2ati0n *«*~ol. *~ example, standards, 
as well as instructions for conducting the test. 

Immunoassay and Diag n ostic Kits 

25 . . „ B ° th the Polypeptides which react immuno- 
logically with serum containing HCV antibodies, for 
example, those detected by the anti 

de . Pr ,u art . . ¥ tne an tigemc screening method 

described infra, in the Examples, as well those derived 
from or encoded within the isolated clones described In 

3q the Examples, and composites thereof, and the antibodies 
raised against the HCV specific epitopes in these 
polypeptides, are useful in immunoassays to detect pres- 
ence of HCV antibodies, or the presence of the virus L/ 
or viral antigens, in biological samples. Design of the 

3s immunoassays is subject to a great deal of variation and 
a. variety of these are known in the art. For exam^lo rt 
i-unoassay may utili ze one viral epitope 
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the immunoassay may use a combination of viral epitopes 
derived from these sources; these epitopes may be derived 
from the same or from different viral polypeptides, and 
may be in separate recombinant or natural polypeptides, or 
together in the same recombinant polypeptides. it may 
use, for example, a monoclonal antibody directed towards a 
viral epitope(s), a combination of monoclonal antibodies 
directed towards epitopes of one viral antigen, monoclonal 
antibodies directed towards epitopes of different viral 
antigens, polyclonal antibodies directed towards the same 
viral antigen, or polyclonal antibodies directed towards 
different viral antigens. Protocols may be based, for 
example, upon competition, or direct reaction, or sandwich 
type assays. Protocols may also, for example, use solid 
supports, or may be by immunoprecipitation . Most assays 
involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, 
radioactive, or dye molecules. Assays which amplify the 
signals from the probe are also known; examples of which 
are assays which utilize biotin and avidin, and enzyme- 
labeled and mediated immunoassays, such as ELISA assays. 

Some of the antigenic regions of the putative 
polyprotein have been mapped and identified by screening 
the antigenicitiy of bacterial expression products of HCV 
cONAs which encode portions of the polyprotein. See the 
Examples. Other antigenic regions of HCV may be detected 
by expressing the portions of the HCV cDNAs in other 
expression systems, including yeast systems and cellular 
systems derived from insects and vertebrates. m addi- 
tion, studies giving rise to an antigenicity index and 
hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity. 

The studies on antigenic mapping by expression 
of HCV cDNAs showed that a number of clones containing 
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these cONAs expressed polypeptides which were im- 

ZBH° 9i r lly r r Ct±Ve W±th SerUm fr0m ^-iduals with 
5 w^h n Sln9 P ° lypeptide Was ideologically reactive 
wxth all sera. Five of these polypeptides were very im- 
munogenic in that antibodies to the HCV epitopes in these 
polypeptides were detected in many different atient 
although the overlap in detection was not complete Zs, 
the results on the immune genie ity of the polypeptides 
encoded in the various clones suggest that ef fecient 
detection systems may include the use of panels of 
epitopes. The epitopes in the panel may be constructed 
into one or multiple polypeptides. 

15 . «.„ K±tS suitable immunodiagnosis and contain- 
xng the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable 
containers, along with the remaining reagents and materi- 
als required for the conduct of the assay, as well as a 
suitable set of assay instructions. 

Further Character i rat i on of the khv „ ^ 

25 V ~ al ™inq P robes Hal ved Fr nm nL* .TTZ^. 
Genome " ■ 

isol ♦ h ^ CDNA SSqUenCe info ^ion in the newly 

isolated clones described in the Examples may be used to 
gam further information on the sequence of the HCV 
3q genome, and for identification and isolation of the HCV 

agent and thus will aid in its characterization including 
the nature of the genome, the structure of the viral 
particle, and the nature of the antigens of which it is 
composed. This information, in turn, can lead to ad- 
35 tiTlZ ^y—^Ld, probes, polypeptides derived from 
the HCV genome, and antibodies directed against HCV 
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epitopes which would be useful for the diagnosis and/or 
treatment of HCV caused NANBH. 

The cDNA sequence information in the above- 
mentioned clones is useful for the design of probes for 
the isolation of additional cDNA sequences which are 
derived from as yet undefined regions of the HCV genome(s) 
from which the cDNAs in clones described herein and in EP 
0,316,218 are derived. For example, labeled probes 
containing a sequence of approximately 8 or more nucleo- 
tides, and preferably 20 or more nucleotides, which are 
derived from regions close to the 5 '-termini or 3 '-termini 
of the composite HCV cDNA sequence shown in Fig. 17 may be 
i5 used to isolate overlapping cDNA sequences from HCV cDNA 
libraries. Alternatively, characterization of the genomic 
segments could be from the viral genome(s) isolated from 
purified HCV particles. Methods for purifying HCV 
particles and for detecting .them during the purification 
2Q procedure are described herein, infra. Procedures for 

isolating polynucleotide genomes from viral particles are 
known in the art, and one procedure which may be used is 
that described in EP 0,218,316. The isolated genomic seg- 
ments could then be cloned and sequenced. An example of 
2s thxs technique, which utilizes amplification of the 

sequences to be cloned, is provided infra., and yielded 
clone 16jh. 

Methods for constructing cDNA libraries are 
known in the art, and are discussed supra and infra; a 
method for the construction of HCV cDNA libraries in 
lambda-gtll is discussed in EPO Pub. No. 318,216 
However, cDNA libraries which are useful for screening 
with nucleic acid probes may also be constructed in other 
vectors known in the art, for example, lambda-gtlO (Huynh 
et al. (1985) ) . 
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Screening for Antl-vir.l „ n . 

The availability of cell culture and animal 
model systems for HCV makes it possible to screen for 
anti-viral agents which inhiblt Hcy MpUM ~ 

preventing viral replication in cell ° n 

=ty-VoT:L^^ 

toxioitvl in L 7 * Pathogenicity (an d a low level of 
13 toxicity) in an animal model system. 

detecting lw CO " POSiti0ns P"™-* herein for 

etectrng HCV antigens and HCV polynucleotides are useful 

alternaT ' °* *» that they provide an 

ina th! ' PerhaPS m ° re ""'"ive means/for detect- 

» zzrjzzz TV,:;'- 1 * repii r on than the :: " 

* 50 assa Y- For example, the HCV- 

polynucleotide probes described herein may be used to 

~ ^ — ■* ° f prodded in a 

hybrioi^n or" 18 COU " *" ^ --pie oy 

25 Hybridization or competition hybridization of the i„f e ct»H 

:robe nUC r elC * labSled HCV-polynucleotide 

to S' . T eXa " PlS ' alS °' anti - BCV -"bodies may be used 
to identify and guantitate HCV antigen, s) in the cell 
culture utilizing the immunoassays described h. 

~ -1^'^: "r y be ---Vto^i^cv ln 

assav thi f Sd CSU CUltUre b * * competition 

assay the polypeptides encoded within the HCV c DMAs 
described herein are useful i, « ™ 
General lv . Z useful in these competition assays. 
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monitored. Moreover, these techniques are particularly 
useful in cases where the HCV may be able to replicate in 
a cell line without causing cell death. 

The anti-viral agents which may be tested for 
efficacy by these methods are known in the art, and 
include, for example, those which interact with virion 
components and/or cellular components which are necessary 
for the binding and/or replication of the virus. Typical 
anti-viral agents may include, for example, inhibitors of 
virion polymerase and/or protease(s) necessary for cleav- 
age of the precursor polypeptides. Other anti-viral 
agents may include those which act with nucleic acids to 
prevent viral replication, for example, anti-sense 
polynucleotides , etc . 

Antisense polynucleotides molecules are 
comprised of a complementary nucleotide sequence which 
allows them to hybridize specifically to designated 
regions of genomes or RNAs. Antisense polynucleotides may 
include, for example, molecules that will block protein 
translation by binding to mRNA, or may be molecules which 
prevent replication of viral RNA by transcriptase. They 
may also include molecules which carry agents (non- 
25 coval ently attached or covalently bound) which cause the 
viral RNA to be inactive by causing, for example, scis- 
sions in the viral RNA. They may also bind to cellular 
polynucleotides which enhance and/or are required for 
viral infectivity, replicative ability, or chronicity. 
3o Antisense molecules which are to hybridize to HCV derived 
RNAs may be designed based upon the sequence information 
of the HCV cDNAs provided herein. The antiviral agents 
based upon anti-sense polynucleotides for HCV may be 
designed to bind with high specificity, to be of increased 
35 solubility, to be stable, and to have low toxicity. 

Hence, they may be delivered in specialized systems, for 
example, liposomes, or by gene therapy. In addition, they 
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ZltZ'Zl™ 1 ? 3 ' at " Ched Pr0teins < —tlt.fi or 
altered bonding between bases, etc. 

5 „oi , 0thSr tYPeS ° f drU9S ^ be b ^=ed upon 
polynucleotides which "m4 m *~». • 

the HCV . important control regions of 

the HCV genome, and which may be therapeutic due to their 

^^rz^7 components of the syst - 

tor viral mfectivity or replication. 
10 General Methods 

, . The general techniques used in extracting the 

genome from « virus, preparing and probing a cdna library 
seguencxng clones, constructing expression vectors' trans! 
15 l°™ n9 CSlls ' P^fo-ing immunological assays Mc * ° 
radioimmunoassays and ELISA assays, for growing „U in 
culture, and the like are known in the art and ,!k \ 
manuals are avaiiable describing these " 
However, as a general guide, the following sets forth 
2fl sources currently available for such procedures , T^IT 
materials useful in carrying them out, 

hfa . j - B ° th P rokar y°tic and eukaryotic host cells may 

appro'riaT ° f d * Si ~ d sequences whef 

appropriate control sequences which are compatible with 

25 T^TZTl h ° St ^ USed ' Photic host e. 

colx xs most frequently used. Expression control ~~ 

sequences for prokaryotes include promoters, optionally 

Tra^fT 9 °?~ P ° rti ° nS ' "* ""^ ^L- 

coZlnT 7 , CO,npatible " ith Prokaryotic hosts are 

30 Z y frora ' for example - pBR322 ^ * PI— 

containing operons conferring ampicillin and tetracycline 
resistance, and the various pUC vectors, which also 

ers ThesrT Miotic resistance mark- 

ers. These markers may be used to obtain successful 
35 transformants by selection. Commonly used prokaryotic 
control sequences include the Beta-lactamase 

(penicillinase) and lactose nmmnf— 

; ana xactose promoter systems (Chang et al. 
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(1977)), the tryptophan (trp) promoter system (Goeddel et 
al. (1980), and the lambda-derived P promoter and N gene 
s rxbosome binding site (Shimatake et al. (1981)) and the 
hybrxd tac promoter (De Boer et al. (1983)) derived from 
sequences of the trp and lac UV5 promoters. The foregoing 
systems are particularly compatible with E. coli- if 
desired, other prokaryotic hosts such as strains 'of Bacil- 
lus or Pseudomonas may be used, with corresponding control 
sequences. * 1 

Eukaryotic hosts include yeast and mammalian 
cells xn culture systems. Saccharamvn— cerevisia e and 
Saccharomyces carlsbergensi s are the most commonly used 

i5 yeast hosts, and are convenient fungal hosts. Yeast 

compatible vectors carry markers which permit selection of 
successful trans formants by conferring prototrophy to 
auxotrophic mutants or resistance to heavy xnetals on wild- 
type strains . Yeast compatible vectors may employ the 2 

2q mxcron origin of replication (Broach et al. (1983)), the 
combination of CEN3 and ARS1 or other means for assuring 
replication, such as sequences which will result in in- 
corporation of an appropriate fragment into the host cell 
genome Control sequences for yeast vectors are known in 

25 tne art and include promoters for the synthesis of 

glycolytic enzymes (Hess et al. (1968); Holland et al. 
(1978),, including the promoter for 3 phosphoglycerate 
kinase (Hitzeman (1980)). Terminators may also be 
included, such as those derived from the enolase gene 

30 ! l^ 1981 ' 1, Particularly useful control systems are 
those which comprise the glyceraldehyde-3 phosphate 
dehydrogenase ( GAPDH ) promoter or alcohol dehydrogenase 

GAPDH rSg " la : able Pr0m0t ~' ^rminators also derived from 
GAPDH, and if secretion is desired, leader sequence from 
35 yeast alpha factor. In addition, the transcriptional 
regulatory region and the transcriptional initiation 
region which are operably linked may be such that they are 
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not naturally associated in the wiln .. 

systems are described in det.l. . ?" type onanism. These 
October 3, 1984; EPO !16 tni tl. ° 12 °' 551 ' ^Wished 

5 - «. is<, 556 , pubiL 1 ^ 0 ^ ssr »• i984; 

«e assigned to the herein assignee and " „ WMCh 
oorporated herein by reference in " 

Mammalian cell lino<? ^ r ^,-i w 
expression are known in the art l d f " h ° StS *" 
10 Ized cell l ines available fro^.K "^"^ ^ ^tal- 

coiiection (ATCC) , ,nZ:d;ziT^Ti;^ r ture 

ovary (C H0, cells, baby hamster ^ " 

number of other cell i ines Stli °? (BHK) C8lls ' and a 

15 " alta " «• also known in the « t P .r ter f f " — 

promoters such as th**- * nd inc ^de viral 

(»7.„. nous s^::" c rr s 40 <sv40) ««- 

bovine papilloma virus (BPV, IL^ ^ ' »* 

-guire terminator seguences "an^yTa^ ^ 
sequences; enhancer sequences which • a<W " lon 
may also be included an!T ""ease expression 

tXon of the gene amplifies- 
are known in the art v^Z """able. These sequences 
-allien cells ^in^lT^ ~* U «»« 
,_ "hich insure integration of It. re P llc °ns, or sequences 

- codmg nanbv epit^rirthTh oTrr se ~ - 

Transformari'^ . * 
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m rane . " OST: genome. 

^reducing ^^T^ ^ ^ ""^ *« 
for example packaging t^CaJ^ 0 ?"' 
transducing a host cell with Sit " " VirUS and 

uptake of the polynucleot^e . ^ZLT ** 
procedure used depends upon ths t« °° 
For example, transformation of the E col 
lambda-gtil containing Wb,^^ 

-idium chloride IS^T^ 
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transformation by direct uptake may be carried out using 
the method of Hinnen et al . (1978). Mammalian transforma- 
tions by direct uptake may be conducted using the calcium 
phosphate precipitation method of Graham and Van der Eb 
(1978), or the various known modifications thereof. 

Vector construction employs techniques which are 
known in the art. Site-specific DNA cleavage is performed 
by treating with suitable restriction enzymes under condi- 
tions which generally are specified by the manufacturer of 
these commercially available enzymes. In general, about 1 
microgram of plasmid or DNA sequence is cleaved by 1 unit 
of enzyme in about 20 microliters buffer solution by 
incubation of 1-2 hr at 37° C. After incubation with the 
restriction enzyme, protein is removed by phenol/ 
chloroform extraction and the DNA recovered by precipita- 
tion with ethanol. The cleaved fragments may be separated 
using polyacrylamide or agarose gel electrophoresis 
techniques, according to the general procedures found in 
Methods in Enzymology (1980) 65:499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coli DNA polymerase I (Klenow) in the pres- 
ence of the appropriate deoxynucleotide triphosphates 
(dNTPs) present in the mixture. Treatment with SI 
nuclease may also be used, resulting in the hydrolysis of 
any single stranded DNA portions. 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase than 
blunt end ligations. when vector fragments are used as 
part of a ligation mixture, the vector fragment is often 
treated with bacterial alkaline phosphatase ( BAP) or calf 
intestinal alkaline phosphatase to remove the 5 '-phosphate 
and thus prevent religation of the vector; alternatively, 
restriction enzyme digestion of unwanted fragments can be 
used to prevent ligation. 
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clonin. h ^ gati ° n mi * tures are transformed into suitable 
clonxng hosts, such as E. coli, and successful 

5 a"nce Sf !nT ntS -tibiotic resist- 

ance, and screened for the correct construction 

Synthetic oligonucleotides may be prepared usina 
an automated oligonucleotide synthesizer as described 

llTTJ t ] 32 " dSSired thS brands may be 

labeled with *h by treatment with polynucleotide kinase 
m the presence of 32 p-atp 1 Kinase 

the reaction. ' * mtmaaM conditions for 

cDNA , 4h SegUenCeS ' deluding those isolated from 

cDNA Ixbrarxes, may be modified by known techniques, 
i5 including, for example site directed mutagenesis, as 
described by Zoller (1982). Briefly, the DNA to be 
modified is packaged into phage as a single stranded 
sequence, and converted to a double stranded DNA with DNA 
polymerase using, as a primer, a synthetic oligonucleotide 
2Q complementary to the portion of the DNA to be modifx^d 
and havxng the desired modification included in its own 
sequence. The resulting double stranded DNA is 
transformed into a phage supporting host bacterium. 
Cultures of the transformed bacteria, which contain 
25 replxcations of each strand of the phage, are plated in 
agar to obtain plaques. Theoretically, 50% of the new 
Plaques contain phage having the mutated sequence, and the 
remaxning 50% have the oriHnai - 

^ , orxgxnal sequence. Replicates of 

the plaques are hybridized to labeled synthetic probe at 

30 l ^ C ° nditi ° ns which P«it hybridization 

wxth the correct strand, but not with the unmodified 
sequence. The sequences which have been identified by 
hybridization are recovered and cloned. 

DNA libraries may be probed using the procedure 

35 of Grunstein and Hogness (1975). Briefly, in this 

procedure, the DNA to be probed is immobilized on nitro- 
cellulose filters, denatured, and prehybridized with a 
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buffer containing 0-50% formamide, 0.75 M NaCI, 75 mM Na 
citrate, 0.02% (wt/v) each of bovine serum albumin, poly- 
vinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 
5 6.5), 0.1% SDS, and 100 micrograms /ml carrier denatured 
DNA. The percentage of formamide in the buffer, as well 
as the time and temperature conditions of the 
prehybridization and subsequent hybridization steps 
depends on the stringency required. Oligomeric probes 
10 which require lower stringency conditions are generally 
used with low percentages of formamide, lower 
temperatures, and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 
derived from cDNA or genomic sequences generally employ 
15 higher temperatures, e.g., about 40-42°C, and a high 
percentage, e.g., 50%, formamide. Following 
prehybridization, 5 '- 32 P-labeled oligonucleotide probe is 
added to the buffer, and the filters are incubated in this 
mixture under hybridization conditions. After washing 
20 the treated filters are subjected to autoradiography to 
show the location of the hybridized probe; DNA in cor- 
responding locations on the original agar plates is used 
as the source of the desired DNA. 

For routine vector constructions, ligation 
25 mixtures are transformed into E. coli strain HB101 or 

other suitable host, and successful transf ormants selected 
by antibiotic resistance or other markers. Plasmids from 
the transf ormants are then prepared according to the 
method of Clewell et al. (1969), usually following 
30 chloramphenicol amplification (Clewell (1972)). The DNA 
is isolated and analyzed, usually by restriction enzyme 
analysis and/or sequencing. Sequencing may be by the 
dideoxy method of Sanger et al . (1977) as further 
described by Messing et al. (1981), or by the method of 
35 Maxam et al . (1980). Problems with band compression, 
which are sometimes observed in GC rich regions, were 
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ir'ZleT. ^ T - dea2 ° 9UanOSin9 to Barr et 

The enzyme-linked immunosorbent assay (ELISAI 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 

the bTd " eithSr " anti98n " M Mtibod ^ ™ 
the bound enzyme activity as a quantitative labei. To 

iq measure antibody, the known antigen is fixed to a soiid 

test e seru!'dil " iCrOPl " S « Plasti = incubated with 

test serum dilutions, washed, incubated with anti- 
immunoglobulin labeled with an enzyme, and washed again 
Enzymes suitable for labeling are known in the art, and' 
include, for example, horseradish peroxidase. Enzyme 
actrvrty bound to the solid phase is measured by aulng 
the specrfrc substrate, and determining product formation 
ZtZ£T r lliZ " i0n "^^cally. The J££ 
body bound " " dlre " fUnCti ° n ° f thS — «' °* 

20 

To measure antigen, a known specific antibody is 
fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is 

IfL^asT 3 SSC r antibody is adde! 

25 estlmaZ T is added, and enzyme activity is 

estimated colorimetrically, and related to antigen 
concentration. 9 
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Examples 

Described below are examples of the present 
invention which are provided only for illustrative 

tiT'ln r",rV° ^ ^ SC ° Pe ° f thS 

» 9 " PrSSent disol °=— . numerous 

embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill m the art. 
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Isolation and Sequence of Overlapping 
HCV cDNA Clones 13i. 26^ . CA59a, CA84a, CA156e and CA167H 

The clones 13i, 26 j, CA59a, CA84a, CA156e and 
CA167b were isolated from the lambda-gtll library which 
contains HCV cDNA (ATCC No. 40394), the preparation of 
which is described in EPO Pub. No. 318,216 (published 31 
May 1989), and WO 89/04669 (published 1 June 1989). 
Screening of the library was with the probes described 
infra., using the method described in Huynh (1985). The 
frequencies with which positive clones appeared with the 
respective probes was about 1 in 50,000. 

The isolation of clone 13i was accomplished 
using a synthetic probe derived from the sequence of clone 
12 f. The sequence of the probe was: 

5 ' GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG 3 ' . 



The isolation of clone 26j was accomplished 
using a probe derived from the 5 '-region of clone K9-1. 
The sequence of the probe was: 

5 ' TAT CAG TTA TGC CAA CGG AAG CGG CCC CGA 3 ' . 

The isolation procedures for clone 12 f and for 
clone k9-l (also called K9-1) are described in EPO Pub. 
No. 318,216, and their sequences are shown in Figs. 1 and 
2, respectively. The HCV cDNA sequences of clones 13i and 
26 j, are shown in Figs. 4 and 5, respectively. Also shown 
are the amino acids encoded therein, as well as the 
overlap of clone 13i with clone 12 f, and the overlap of 
clone 26 j with clone 13i. The sequences for these clones 
confirmed the sequence of clone K9-1. Clone K9-1 had been 
isolated from a different HCV cDNA library (See EP 
0,218,316). 
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Clone CA59a was isolated utilizing a probe based 
upon the sequence of the 5 '-region of clone 26 j. The 
sequence of this probe was: 

5 ' CTG GTT AGC AGG GCT TTT CTA TCA CCA CAA 3 ' . 

A probe derived from the sequence of clone CAS9 a 
iQ was used to isolate clone CA84a. The sequence of the 
probe used for this isolation was: 

5 ' AAG GTC CTG GTA GTG CTG CTG CTA TTT GCC 3 ' . 

15 Clone CA156e was isolated using a probe derived 

from the sequence of clone CA84a. The sequence of the 
probe was : 

5 ' ACT GGA CGA CGC AAG GTT GCA ATT GCT CTA 3 ' . 

20 

Clone CA167b was isolated using a probe derived 
from the sequence of clone CA 156e. The sequence of the 
probe was: 

25 5' TTC GAC GTC ACA TCG ATC TGC TTG TCG GGA 3 '. 

The nucleotide sequences of the HCV cDNAs in 
clones CA59a, CA84a, CA156e, and CA167b, are shown Figs. 
6, 7, 8, and 9, respectively. The amino acids encoded 
3o therein, as well as the overlap with the sequences of 
relevant clones, are also shown in the Figs. 

Creation of "pi" H CV cDNA Library 
A library of HCV cDNA, the "pi" library was 
35 constructed from the same batch of infectious chimpanzee 
plasma used to construct the lambda-gtll HCV cDNA library 
(ATCC No. 40394) described in EPO Pub. No. 318,216, and 
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utilizing essentially the same techniques. However, 
construction of the pi library utilized a primer-extension 
method, in which the primer for reverse transcriptase was 
based on the sequence of clone CA59A . The sequence of the 
primer was: 

5 ' GGT GAC GTG GGT TTC 3 ' . 

Isolation and Sequence of Clone pi!4a 
Screening of the "pi" HCV cDNA library described 
supra., with the probe used to isolate clone CA167b (See 
supra.) yielded clone pil4a. The clone contains about 800 
base pairs of cDNA which overlaps clones CA167b, CA156e, 
CA84a and CA59a, which were isolated from the lambda gt-11 
HCV cDNA library (ATCC No. 40394). In addition, pil4a also 
contains about 250 base pairs of DNA which are upstream of 
the HCV cDNA in clone CA167b. 

Isolation and Seq uence of Clones CA216a, CA290a and aq30a 

Based on the sequence of clone CA167b a 
synthetic probe was made having the following sequence: 

5' GGC TTT ACC ACG TCA CCA ATG ATT GCC CTA 3' 

The above probe was used to screen the , which yielded 
clone CA216a, whose HCV sequences are shown in Fig. 10. 

Another probe was made based on the sequence of 
clone CA216a having the following sequence: 

5' TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG 3' 



Screening the lambda-gtll library (ATCC No. 40394) with 
35 this probe yielded clone CA290a, the HCV sequences therein 
being shown in Fig. 11. 
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in a parallel approach, a primer-extension cDNA 
Ixbrary was made using nucleic acid extracted from the 
'JZ ^ 6Ctious Pl«~ used in the original lambda-gtll 
cDNA Ixbrary described above. The primer used was based 
on the sequence of clones CA216a and CA290a: 

5' GAA GCC GCA CGT AAG 3' 

The cDNA library was made using methods similar to those 
descrxbed previously for libraries used in the isolation 
of clones pi!4a and k9-l. The probe used to screen this 
Ixbrary was based on the sequence of clone CA290a: 

5' CCG GCG TAG GTC GCG CAA TTT GGG TAA 3' 

Clone ag30a was isolated from the new library with the 
above probe, and contained about 670 basepairs of HCV 

20 ITlrT' 12 ' Part ° f th±S -erlaps 

the HCV sequence of clones CA216a and CA290a. About 300 

base-pairs of the ag30a sequence, however, is upstream of 

the sequence from clone CA290a. The non-overlapping 

sequence shows a start codon (*, and stop codons that may 

2s xndxcate the start of the HCV ORF . Also indicated in Fig. 

12 are putative small encoded peptides (#) which mav pla y 

a role xn regulating translation, as well as the putative 

first amxno acid of the putative polypeptide (/), and 

downstream amino acids encoded therein. 

30 

Isolation and sequence of clone CA?ns a 
Clone CA205a was isolated from the original 
lambda gt-11 library (ATCC No. 40394), using a synthetic 
probe derived from the HCV sequence in clone CA290a (Fig. 
35 11). The sequence of the probe was: 

5 ' TCA GAT CGT TGG TGG AGT TTA CTT GTT GCC 3 ' . 
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The sequence of the HCV cDNA in CA205a, shown in Fig. 13, 
overlaps with the cDNA sequences in both clones ag30a and 
CA290a. The overlap of the sequence with that of CA290a 
is shown by the dotted line above the sequence (the figure 
also shows the putative amino acids encoded in this frag- 
ment ) . 

As observed from the HCV cDNA sequences in 
clones CA205a and ag30a, the putative HCV polyprotein ap- 
pears to begin at the ATG start codon; the HCV sequences 
in both clones contain an in-frame, contiguous double stop 
codon (TGATAG) forty two nucleotides upstream from this 
ATG. The HCV ORF appears to begin after these stop 
codons, and to extend for at least 8907 nucleotides (See 
the composite HCV cDNA shown in Fig. 17). 

Isolation and Sequence of Clone 18q 
Based on the sequence of clone ag30a (See Fig. 
12) and of an overlapping clone from the original lambda 
gt-11 library (ATCC No. 40394), CA230a, a synthetic probe 
was made having the following sequence: 

5 ' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 ' . 

Screening of the original lambda-gtll HCV cDNA library 
with the probe yielded clone 18g, the HCV cDNA sequence of 
which is shown in Fig. 14. Also shown in the figure are 
the overlap with clone ag30a, and putative polypeptides 
encoded within the HCV cDNA. 

The cDNA in clone 18g (C18g or 18g) overlaps 
that in clones ag30a and CA205a, described supra. The 
sequence of C18g also contains the double stop codon 
region observed in clone ag30a. The polynucleotide region 
upstream of these stop codons presumably represents part 
of the 5' -region of the HCV genome, which may contain 
short ORFs, and which can be confirmed by direct sequenc- 
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ing of the purified HCV genome. These putative small 
encoded peptides may play a regulatory role in trans la- 
^ tion. The region of the HCV genome upstream of that 

represented by C18g can be isolated for sequence analysis 
using essentially the technique described in EPO Pub. No. 
318,216 for isolating cONA sequences upstream of the HCV 
cDNA sequence in clone 12f . Essentially, small synthetic 

io oligonucleotide primers of reverse transcriptase, which 
are based upon the sequence of C18g, are synthesized and 
used to bind to the corresponding sequence in HCV genomic 
RNA. The primer sequences are proximal to the known 5'- 
terminal of C18g, but sufficiently downstream to allow the 
design of probe sequences upstream of the primer 
sequences. Known standard methods of priming and cloning 
ar eused. The resulting cDNA libraries are screened with 
sequences upstream of the priming sites (as deduced from 
the elucidated sequence of C18g) . The HCV genomic RNA is 

2o obtained from either plasma or liver samples from 

individuals with NANBH. Since HCV appears to be a Flavi- 
like virus, the 5 '-terminus of the genome may be modified 
with a "cap" structure. it is known that Flavivirus 
genomes contain 5 '-terminal "cap" structures. (Yellow 

25 Fever virus, Rice et al. (1988); Dengue virus, Hahn et al 
(1988); Japanese Encephalitis Virus (1987)). 

Isolation and Sequence of Clones from 
the beta-HCV cDNA library 

30 Clones containing cDNA representative of the 3'- 

terminal region of the HCV genome were isolated from a 
cDNA library constructed from the original infectious 
chimpanzee plasma pool which was used for the creation of 
the HCV cDNA lambda-gtll library (ATCC No. 40394), 

35 described in EPO Pub. No. 318,216. In order to create the 
DNA library, RNA extracted from the plasma was "tailed" 
with poly rA using poly (rA) polymerase, and cDNA was 
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synthesized using oligo(dT) 12 _ 18 as a primer for reverse 
transcriptase. The resulting RNA: cDNA hybrid was digested 
with RNAase H, and converted to double stranded HCV cDNA. 
The resulting HCV cDNA was cloned into lambda-gtlO, using 
essentially the technique described in Huynh (1985), 
yielding the beta (or b) HCV cDNA library. The procedures 
used were as follows. 

An aliquot (12ml) of the plasma was treated with 
proteinase K, and extracted with an equal volume of phenol 
saturated with 0.05M Tris-Cl, pH 7.5, 0.05% (v/v) beta- 
mere aptoethanol , 0.1% (w/v) hydroxyquinolone , 1 mM EDTA. 
The resulting aqueous phase was re-extracted with the 
phenol mixture, followed by 3 extractions with a 1:1 
mixture containing phenol and chloroform: isoamyl alcohol 
(24:1), followed by 2 extractions with a mixture of 
chloroform and isoamyl alcohol (1:1). Subsequent to 
adjustment of the aqueous phase to 200 mM with respect to 
20 NaC1 ' nucleic acids in the aqueous phase were precipitated 
overnight at -20°C, with 2.5 volumes of cold absolute 
ethanol. The precipitates were collected by centrif liga- 
tion at 10,000 RPM for 40 rain., washed with 70% ethanol 
containing 20 mM NaCl, and with 100% cold ethanol, dried 
for 5 min. in a dessicator, and dissolved in water. 

The isolated nucleic acids from the infectious 
chimpanzee plasma pool were tailed with poly rA utilizing 
poly-A polymerase in the presence of human placenta 
ribonuclease inhibitor (HPRI) (purchased from Amersham 
Corp.), utilizing MS 2 RNA as carrier. Isolated nucleic 
acids equivalent to that in 2 ml of plasma were incubated 
in a solution containing TMN (50 mM Tris HC1, pH 7.9, 10 
mM MgCl 2 , 250 mM NaCl, 2.5 mM MnCl 2 , 2 mM dithiothreitol 
(DTT) ) , 40 micromolar alpha-[ 32 P] ATP, 20 units HPRI 
(Amersham Corp.), and about 9 to 10 units of RNase free 
poly-A polymerase (BRL) . Incubation was for 10 min. at 
37 C, and the reactions were stopped with EDTA (final 
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concentration about 250 mM) . The so l u n ft n 
with an i i soiutxon was extracted 

W xth an equal volume of phenol -chloroform, and with an 
5 equal volume of chloroform, and nucleic acids were 

ethan^ tate l° Verni9ht '* - 2 °° C With 2 ' 5 of 
ethanol in the presence of 200 mM NaCl. 

Isolation r,f clone bS» 

10 hvbr'H- J 1 " 6 CDNA llbrary Was greened by 

hybrxdxzatxon using a synthetic probe, which had a 

Tris^L^r 71 CDNA SSqUenCe ±n Cl °- 

318,216, and xts sequence is shown in Fig. 3. The 
sequence of the synthetic probe was: 

5' ATT GCG AGA TCT ACQ GGG CCT OCT ACT CCA 3 • . 

Screening of the library yielded clone beta-5a (b5a, 
2fl which contains an HCV cDNA region or approximately 000 

351, 19g, 26g, and 15e (these clones are described suora, 
Region between the 3.-terminal poly-A segued e 
3 -seguence which overlaps clone 15e. contains ap- 

25 loont^ I PalrS - ™ e clone the 

identification of a region of the l- , 

HCV genome. -terminal seguence the 

The seguence of b5a is contained within the 
seguence of the HCV cDNA in clone l 6Jh (described Infra, 
jo Moreover, the seguence is also present in CC34a, isolated 
from the original lambda-gtu library (ATCC Ho. 40 33i T 

r ~rSbraryr da " 9t11 ^ " * " " 
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Isolation and Sequ ence of Clones Generated by PCR 
Amplification of the 3 '-Region of the HCV Genome 

Multiple cDNA clones have been generated which 
contain nucleotide sequences derived from the 3 '-region of 
the HCV genome. This was accomplished by amplifying a 
targeted region of the genome by a polymerase chain re- 
action technique described in Saiki et al. (1986), and in 
Saiki et al. (1988), which was modified as described 
below. The HCV RNA which was amplified was obtained from 
the original infectious chimpanzee plasma pool which was 
used for the creation of the HCV cDNA lambda-gtll library 
(ATCC No. 40394) described in EPO Pub. No. 318,216. 
Isolation of the HCV RNA was as described supra. The 
isolated RNA was tailed at the 3 '-end with ATP by E. coli 
poly-A polymerase as described in Sippel (1973), except 
that the nucleic acids isolated from chimp serum were 
substituted for the nucleic acid substrate. The tailed 
RNA was then reverse transcribed into cDNA by reverse 
transcriptase, using an oligo dT-primer adapter, es- 
sentially as described by Han (1987), except that the 
components and sequence of the primer-adapter were: 

Stuffer NotI SP6 Promoter Primer 

AATTC GCGGCCGC CATACGATTTAGGTGACACTATAGAA T 15 

The resultant cDNA was subjected to amplification by PCR 
using two primers: 

Primer Sequence 
JH32 (30mer) ATAGCGGCCGCCCTCGATTGCGAGATCTAC 
JH11 (20mer) AATTCGGGCGGCCGCCATACGA 

35 The JH32 primer contained 20 nucleotide sequences 

hybridizable to the 5 '-end of the target region in the 
cDNA, with an estimated T m of 66°C. The JH11 was derived 
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from a portion of the oli go dT-primer adapter- thus «* ■ 
specific to the 3 -end of the cDNA with .TV£c " 

5 P T SrS W6re dSSigned to ^ve a recognition site for 

the restriction enzyme, No tI, at the 5<-end, for use in 
subsequent cloning of the amplified HCV cDNA. 

The PCR reaction was carried out by susoendinrr 
the cDNA and the primers in 100 microliters of rea"io7 
mxxture containing the four deoxynucleoside trJLsphLes 
buffer salts and metal ions, and a thermostable oul ' 
polymerase isolated from Thermus a*^,^ (Taq 
polymerase), which are in a Perkin Elmer Cetus PGR kit 
(N801-0043 or N801-0055) . The PCR r> & »t+ 

f n r- r»c , CR reac tion was performed 

15 Each f T 3 Perki " W ~ ? CStUS DM * «e OTa r cyc"r 

Bach cycle consisted of a l . a mln denaturation step at 
y« C, an annealing step at 60°r o 

extension step at 72°c for 3 mL J m1 "' "" 1 ' Pri " er 
.. for 3 nun. The PCR products were 

20 the "Lcerlin^ "'"^ °* " hiCh ***** of 

the -terminal regron of clone 15e. The sequence of JH 34 

5 ' CTT GAT CTA CCT CCA ATC ATT CAA AGA CTC 3- . 

25 The PCR products detected by the HCV cDNA probe ranaed <„ 
srze from about 50 to about 400 base pairs ' 

^ In ° rdeX t0 Cl ° ne the HCV cDNA- the 

PCR products were cleaved with NotI anrt .< , 

i , xul ucjci ana size selectPri hw 

30 LsnaTrs 3 "" 9 T el9 ="°«*»»"^. DNA larger tnan 300 
base parrs was cloned into the NotI site of p DC 18S The 
vector pOCXSS is constructed by including a NotI 

oUcL ™ b9t " een the ECORI and S *" «te s of 

PUC18. The clones were screened for HCV cDNA using the 
3s JH34 probe. A number of positive clones were obtained and 
sequenced. The nucleotide sequence of the HCV cDNHnsert 
m one of these clones, isjh, and the amino acids encoded 
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therein, are shown in Fig. 15. a nucleotide heterogene- 
ity, detected in the sequence of the HCV cDNA in clone 
16 jh as compared to another clone of this region, is 
indicated in the figure. 

Compiled HCV cDNA Sequences 
An HCV cDNA sequence has been compiled from a 
iq series of overlapping clones derived from the various HCV 
cDNA libraries described supra.. In this sequence, the 
compiled HCV cDNA sequence obtained from clones bll4a, 
18g, ag30a, CA205a, CA290a, CA216a, pi!4a, CA167b, CA156e, 
CA84a, and CA59a is upstream of the compiled HCV cDNA 

is sequence published in EPO Pub. No. 318,216, which is shown 
in Fig. 16. The compiled HCV cDNA sequence obtained from 
clones b5a and 16jh downstream of the compiled HCV cDNA 
sequence published in EPO Pub. No. 318,216. 

Fig. 17 shows the compiled HCV cDNA sequence 

2o derived from the above-described clones and the compiled 
HCV cDNA sequence published in EPO Pub. No. 318,216. The 
clones from which the sequence was derived are bll4a, 18g, 
ag30a, CA205a, CA290a, CA216a, pil4a, CA167b, CA156e! 
CA84a, CA59a, K9-1 (also called k9-l),26j, 13i, 12f, 14i, 

2s lib, 7f, 7e, 8h, 33c, 40b, 37b, 35, 36, 81, 32, 33b, 25c! 
14c, 8f, 33f, 33g, 39c r 35f, 19g, 26g, 15e, b5a, and 16jh. 
in the figure the three dashes above the sequence indicate 
the position of the putative initiator methionine codon. 

Clone bll4a was obtained using the cloning 

3q procedure described for clone b5a, supra., except that the 
probe was the synthetic probe used to detect clone 18g, 
supra. Clone bll4a overlaps with clones 18g, ag30a, and 
CA205a, except that clone bll4a contains an extra two 
nucleotides upstream of the sequence in clone 18g (i.e., 

35 5'-CA). These extra two nucleotides have been included' in 
the HCV genomic sequence shown in Fig. 17. 
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It should be noted that although several of the 
clones described supra, have been obtained from libraries 
other than the original HCV cDNA lambda-gtll C librarv 
(ATCC No. 40394), these clones contain HCV cDNA sequelces 
whxch overlap HCV cDNA sequences in the original library 
Thus, essentially all of the HCV sequence is derivable 
from the original lambda-gtll C library (ATCC No. 40394) 
whxch was used to isolate the first HCV cDNA clone (5-1- 

11: 3": 2i6 solation of clone 5 - 1 - 1 is * « Pub. 
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Purification of F u sion Polypeptide Cinn-^ 
15 (Alternate method) 

c mo , fUSi ° n P ° lype P tide ' C1 00"3 (also called HCV 

clOO-3 and alternatively, cl00-3), is comprised of 

Cloned f SmUtaSe (S ° D) ^ N - te - in - - in-frame 

C100 HCV polypeptide at the C-terminus . A method for 

preparing the polypeptide by expression in yeast, and dif- 
ferential extraction of the insoluble fraction of the 
extracted host yeast cells, is described in EPO Pub. No. 
318,216. An alternative method for the preparation of 
thxs fusion polypeptide is described below. m this 
25 method the antigen is precipitated from the crude cell 
lysate with acetone; the acetone precipitated antigen is 

TrlfZsT^S 0 ±0n - eXChan ^ chromatography, and further 
purxfxed by gel filtration. 

The fusion polypeptide, C100-3 (HCV c 100-3) is 
3q expressed in yeast strain JSC 308 (ATCC No. 20879) 
transformed with P AB24C100-3 (ATCC No. 67976); the 
transformed yeast are grown under conditions which allow 

IZTll^l k 1 * 6 " ^ grOWth in YEP cont *ining 1% glucose). 
(See EPO Pub. No. 318,216). A cell l ysat e is prepared by 
35 suspendxng the cells in Buffer A (20 mM Tris HC1, P H 8.0, 

^ ^ * PMSF - The -e broken by grinding 

wxth glass beads in a Dynomill type homogenizer or its 
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equivalent. The extent of cell breakage is monitored by 
counting cells under a microscope with phase optics. 
Broken cells appear dark, while viable cells are light- 
colored. The percentage of broken cells is determined. 

When the percentage of broken cells is ap- 
proximately 90% or greater, the broken cell debris is 
separated from the glass beads by centrif ugation, and the 
glass beads are washed with Buffer A. After combining the 
washes and homogenate , the insoluble material in the 
lysate is obtained by centrif ugation. The material in the 
pellet is washed to remove soluble proteins by suspension 
in Buffer B (50 mM glycine, pH 12.0, 1 mM DTT, 500 mM 
NaCl), followed by Buffer C (50 mM glycine, pH 10.0, 1 mM 
DTT) . The insoluble material is recovered by centrifuga- 
tion, and solubilized by suspension in Buffer C containing 
SDS. The extract solution may be heated in the presence 
of beta-mercaptoethanol and concentrated by 
ultrafiltration. The HCV cl00-3 in the extract is 
precipitated with cold acetone. If desired, the 
precipitate may be stored at temperatures at about or 
below -15°C. 

Prior to ion exchange chromatography, the 
acetone precipitated material is recovered by centrifuga- 
tion, and may be dried under nitrogen. The precipitate is 
suspended in Buffer D (50 mM glycine, pH 10.0, 1 mM DTT, 7 
M urea), and centrif uged to pellet insoluble material. 
The supernatant material is applied to an anion exchange 
column previously equilibrated with Buffer D. Fractions 
are collected and analyzed by ultraviolet absorbance or 
gel electrophoresis on SDS polyacryl amide gels. Those 
fractions containing the HCV c 100-3 polypeptide are 
pooled. 

In order to purify the HCV c 100-3 polypeptide by 
gel filtration, the pooled fractions from the ion-exchange 
column are heated in the presence of beta-mercaptoethanol 
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and SDS, and the eluate is concentrated by 
ultrafiltration. The concentrate is applied to a gel 
filtration column previously equilibrated with Buffer E 
(20 mM Tris HC1, pH 7.0, 1 mM DTT, 0.1% SDS). The pres- 
ence of HCV clOO-3 in the eluted fractions, as well as the 
presence of impurities, are determined by gel 
electrophoresis on polyacryl amide gels in the presence of 
SDS and visualization of the polypeptides. Those frac- 
tions containing purified HCV clOO-3 are pooled. Frac- 
tions high in HCV clOO-3 may be further purified by 
repeating the gel filtration process. if the removal of 
particulate material is desired, the HCV clOO-3 containing 
material may be filtered through a 0.22 micron filter. 

Expression and Antigenicity of Polypeptides 
Encoded in HCV cDNA 
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Polypeptides Expressed in E. coli 
The polypeptides encoded in a number of HCV 
cDNAs which span the HCV genomic ORF were expressed in E. 
coli, and tested for their antigenicity using serum 
obtained from a variety of individuals with NANBH. 
The expression vectors containing the cloned HCV cDNAs 
were constructed from pSODcfl (Steimer et al. (1986). In 
order to be certain that a correct reading frame would be 
achieved, three separate expression vectors, pcflAB, 
pcflCD, and pcflEF were created by ligating either of 
three linkers, AB, CD, and EF to a BamHI-EcoRI fragment 
derived by digesting to completion the vector pSODcfl with 
EcoRI and BamHI, followed by treatment with alkaline 
phosphatase. The linkers were created from six oligomers, 
A, B, C, D, E, and F. Each oligomer was phosphorylated by 
treatment with kinase in the presence of ATP prior to an- 
nealing to its complementary oligomer. The sequences of 
the synthetic linkers were the following. 
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Name DNA Sequence ( 5 ' to 3 ' ) 

A GATC CTG AAT TCC TGA TAA 

B GAC TTA AGG ACT ATT TTA A 

C GATC CGA ATT CTG TGA TAA 

D GCT TAA GAC ACT ATT TTA A 

E GATC CTG GAA TTC TGA TAA 

F GAC CTT AAG ACT ATT TTA A 

Each of the three linkers destroys the original EcoRI 
site, and creates a new EcoRI site within the linker, but 
within a different reading frame. Hence, the HCV cDNA 
EcoRI fragments isolated from the clones when inserted 
into the expression vector, were in three different read- 
ing frames . 

The HCV cDNA fragments in the designated lambda- 
gtll clones were excised by digestion with EcoRI; each 
fragment was inserted into pcflAB, pcflCD, and pcflEF. 
These expression constructs were then transformed into 
D1210 E. coli cells, the transformants were cloned, and 
recombinant bacteria from each clone were induced to 
express the fusion polypeptides by growing the bacteria in 
the presence of IPTG. 

Expression products of the indicated HCV cDNAs 
were tested for antigenicity by direct immunological 
screening of the colonies, using a modification of the 
method described in Helfman et al. (1983). Briefly, as 
shown in Fig. 18, the bacteria were plated onto nitro- 
cellulose filters overlaid on ampicillin plates to give 
approximately 1,000 colonies per filter. Colonies were 
replica plated onto nitrocellulose filters, and the 
replicas were regrown overnight in the presence of 2 mil 
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IPTG and ampicillin. The bacterial colonies were lysed by 
suspending the nitrocellulose filters for about 15 to 20 
min in an atmosphere saturated with CHC1 3 vapor. Each 
filter then was placed in an individual 100 mm Petri dish 
containing 10 ml of 50 mM Tris HC1, pH 7.5, 150 mM NaCl, 5 
mM MgCl 2 , 3% (w/v) BSA, 40 micrograms /ml lysozyme, and 0.1 
microgram/ml DNase. The plates were agitated gently for 
at least 8 hours at room temperature. The filters were 
rinsed in TBST (50 mM Tris HC1, pH8.0, 150 mM NaCl, 0.005% 
Tween 20). After incubation, the cell residues were 
rinsed and incubated in TBS (TBST without Tween) contain- 
ing 10% sheep serum; incubation was for 1 hour. The 
filters were then incubated with pretreated sera in TBS 
from individuals with NANBH, which included: 3 
chimpanzees; 8 patients with chronic NANBH whose sera were 
positive with respect to antibodies to HCV C100-3 
polypeptide (described in EPO Pub. No. 318,216, and 
supra.) (also called C100); 8 patients with chronic NANBH 
whose sera were negative for anti-ClOO antibodies; a 
convalescent patient whose serum was negative for anti- 
C100 antibodies; and 6 patients with community acquired 
NANBH, including one whose sera was strongly positive with 
respect to anti-ClOO antibodies, and one whose sera was 
marginally positive with respect to anti-ClOO antibodies. 
The sera, diluted in TBS, was pretreated by preabsorption 
with hSOD. Incubation of the filters with the sera was 
for at least two hours. After incubation, the filters 
were washed two times for 30 min with TBST. Labeling of 
expressed proteins to which antibodies in the sera bound 
was accomplished by incubation for 2 hours with 125 i- 
labeled sheep anti-human antibody. After washing, the 
filters were washed twice for 30 min with TBST, dried, and 
autoradio graphed . 

A number of clones (see infra.) expressed 
polypeptides containing HCV epitopes which were im- 
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munologically reactive with serum from individuals with 
NANBH. Five of these polypeptides were very immunogenic 
in that antibodies to HCV epitopes in these polypeptides 
were detected in many different patient sera. The clones 
encoding these polypeptides, and the location of the 
polypeptide in the putative HCV polyprotein (wherein the 
amino acid numbers begin with the putative initiator 
codon) are the following: clone 5-1-1, amino acids 1694- 
1735; clone C100, amino acids 1569-1931; clone 33c, amino 
acids 1192-1457; clone CA279a, amino acids 1-84; and clone 
CA290a amino acids 9-177. The location of the immunogenic 
polypeptides within the putative HCV polyprotein are shown 
immediately below. 
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Clone 



CA279a 
CA74a 
131 
CA290a 
33c 
40b 
5-1-1 
81 
33b 
25c 
14c 
8£ 
33f 
33g 
39c 
15e 
C100 



1 

437 
511 

9- 

1192- 
1266- 
1694- 
1689- 
1916- 
1949- 
2054 
2200 
2287 
2348- 
2371. 
2796- 
1569- 



-84 
-582 
-690 
•177 
1457 
1428 
-1735 
-1805 
-2021 
-2124 
-2223 
-3325 
■2385 
■2464 
•2502 
2886 
1931 
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£t efficient detection a „d WmT^ TL^' 
include panels of HCV polypeptides/epitopes * 

35 E xpression of HCV Epitop e . In Tgast 
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ing frames are constructed. The construction of one of 
the vectors, pAB24C100-3 is described in EPO Pub. No. 
318,216. In the studies below, the HCV cDNA from the 
clones listed in supra, in the antigenicity mapping study 
using the coli expressed products are substituted for 
the C100 HCV cDNA. The construction of the other vectors 
replaces the adaptor described in the above E. coli stud- 
ies with one of the following adaptors: 

Adaptor 1 



ATT TTG AAT TCC TAA TGA G 
15 AC TTA AGG ATT ACT CAG CT 

Adaptor 2 

AAT TTG GAA TTC TAA TGA G 
20 AC CTT AAG ATT ACT CAG CT. 

The inserted HCV cDNA is expressed. in yeast transformed 
with the vectors, using the expression conditions 
described supra, for the expression of the fusion 
25 polypeptide, C100-3. The resulting polypeptides are 
screened using the sera from individuals with NANBH, 
described supra, for the screening of immunogenic 
polypeptides encoded in HCV cDNAs expressed in E^ coli . 



Comparison of the Hydrophobic Profiles of HCV 
Polyproteins wit h West Nile Virus Polvprotein 
and with Dengue Virus NSl 
The hydrophobic ity profile of an HCV polyprotein 
segment was compared with that of a typical Flavivirus, 
35 West Nile virus. The polypeptide sequence of the West 
Nile virus polyprotein was deduced from the known 
polynucleotide sequences encoding the non-structural 
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proteins of that virus. The. nrv „„i .. . 

deduced from Polyprotein sequence was 

deduced from the sequence of overlapping cDNA clones. The 

5 uses a w,^ Tr 1 "^ USi,>9 a ° anti9en »"ich 
uses a wmdow of 7 amino acid width (the amino acid in 

questxon, and 3 residues on each side) to report the aver 
age hydrophobic^ about a 9iven ^ actd ^ 
parameters giving the reactive hydrophobicity for each 
ammo acid residue are from Kyte and Doolittle (l 982 , 
Fxg. 19 shows the hydrophobic profiles of the two 
polyproteins; the areas corresponding to the non- 

are indicated in the figure. As seen in the figure, there 
i5 is a general similarity in the profiles of the HCv 
polyprotein and the West Nile virus polyprotein 

The sequence of the amino acids encoded in the 

witTtT ^ CDNA ShOWn ^ 16 ha5 

with the corresponding region of one of the strains of 

20 orofir V T S ' dSSCribed 5Upra - " ith «=P-« to the 

(data not sh e9i ! nS °* hydr< * h ° bi <="r and hydrophilicity 
(data not shown). This comparison indicated that the 
polypeptxdes from HCV and Dengue encoded in this region, 

rne:eofr r n:r ds to , he re9ion enco<un » bsi «« « *** - 

2s thereof), have a sxmxlar hydrophobic/hydrophilic profile 
The similarity in hydrophobicity profiles i„ 
combination with the previously identified homologies in 

E 0™ sMe" Se9UenCeS ° f Md «-^'rus in 

of the p, SU " eS " th « " related to these members 

30 Flavivirus family. 

Within the HCV qrf ' 

thL h .! ^ de8cribed in E *° W. No. 318,216 and 
those described supra. It may be deduced from the 
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sequence that the HCV genome contains primarily one long 
continuous ORF, which encodes a polyprotein. In the 
sequence, nucleotide number 1 corresponds to the first 
nucleotide of the initiator MET codon; minus numbers 
indicate that the nucleotides are that distance away in 
the 5 '-direction (upstream), while positive numbers 
indicate that the nucleotides are that distance away in 
10 3 '" direction (downstream). The composite sequence 

shows the "sense" strand of the HCV cDNA. 

The amino acid sequence of the putative HCV 
polyprotein deduced from the HCV cDNA sense strand 
sequence is also shown in Fig. 17, where position 1 begins 
with the putative initiator methionine. 

Possible protein domains of the encoded HCV 
polyprotein, as well as the approximate boundaries, are 
the following (the polypeptides identified within the 
parentheses are those which are encoded in the Flavivirus 
domain) : 
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Putative Domain Approximate Boundary 

(amino acid nos.) 

"C" (nucleocapsid protein) 1-120 

"E" (Virion envelope protein(s) 120-400 
and possibly matrix (M) 
proteins 

"NS1" (complement fixation 400-660 
antigen? ) 

"NS2 " (unknown function) 660-1050 
"NS3 " (protease?) 1050-1640 
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"NS4 " (unknown function, 1640-2000 
"NS5 " (polymerase) 



2000-? end 



It should be noted, however, that hydrophobic itv nmf,, 
(described infra), indicate that Hc * J^^^ , 
Flavivirus model, particularly with respecl to Z reoion 
upstream of NS2. Moreover, the boundaries indicated Ire 
not intended to show fir* demarcations between the putH 
txve polypeptides. puta- 
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The HydrophUic and AnH^c Pr Q fn 0 ^ ^ 

and th e ! r ° fileS ° f "y^PWlicity/hydrophobicity 
and the antigenic index of the putative polyprotein 

ZllZnt: T CDNA S6qUenCe ShOWl in " -re 

determined by computer analysis. The program for 

hydrophilicity/hydrophobicity was as described supra The 

r^eTon thff IT^ *** * ^ 
relxes on the following criteria: 1) surface probability 

SET °l ! lpha - heliCit ^ * *~ Afferent method;,, 
method IS" ° H f . b6ta - Sheet -^ions by two different 
methods; 4) prediction of U-turns by two different 

ZlllZ - L^T ° Philicit y /h ^-Phobicity; and flex- 
ibility. The traces of the profiles generated by the 
computer analyses are shown in Pig. 20. m the 

^ iU h Ci ? Pr0fi1 *' ^flection above the abscissa 
indicates hydrophilicity, and below the abscissa indicates 

T^ln^l' • Polypeptide reg^ 

antigenic is usually considered to increase when there 



WO 90/11089 



-89- 



PCT/US90/01348 



is a deflection upward from the abscissa in the 
hydrophilic and/or antigenic profile. It should be noted, 
however, that these profiles are not necessarily indica- 
tors of the strength of the immunogenicity of a 
polypeptide . 

Identification of Co-linear Peptides in HCV and 
Flaviviruses 

The amino acid sequence of the putative 
polyprotein encoded in the HCV cDNA sense strand was 
compared with the known amino acid sequences of several 
members of Flaviviruses. The comparison shows that homol- 
ogy is slight, but due to the regions in which it is 
found, it is probably significant. The conserved co- 
linear regions are shown in Fig. 21. The amino acid 
numbers listed below the sequences represent the number in 
the putative HCV polyprotein (See Fig. 17.) 

The spacing of these conserved motifs is similar 
between the Flaviviruses and HCV, and implies that there 
is some similarity between HCV and these flaviviral 
agents . 



The following listed materials are on deposit 
under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC) , 12301 Parklawn Dr., 
Rockville, Maryland 20852, and have been assigned the fol- 
lowing Accession Numbers. 
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lambda-qtll 
HCV cDNA library 
clone 81 
clone 91 
clone 1-2 
clone 5-1-1 
clone 12f 
clone 35f 
clone 15e 
clone K9-1 
JSC 308 
pS356 



ATCC No. 

40394 

40388 

40389 

40390 

40391 

40514 

40511 

40513 

40512 

20879 

67683 



Deposit Date 
1 Dec. 1987 
17 Nov. 1987 
17 Nov. 1987 

17 Nov. 1987 

18 Nov. 1987 
10 Nov. 1988 
10 Nov. 1988 
10 Nov. 1988 
10 Nov. 1988 

5 May 1988 
29 April 1988 



In addition, the following deposits were made on 11 May 
1989. 
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Strain 




Linkers 


ATCC No 


D1210 


(Cfl/5-1-1) 


EF 


67967 


D1210 


(Cfl/81) 


EF 


67968 


D1210 


(Cfl/CA74a) 


EF 


67969 


D1210 


(Cfl/35f ) 


AB 


67970 


D1210 


(C£l/279a) 


EF 


67971 


D1210 


(Cfl/C36) 


CD 


67972 


D1210 


(Cf l/13i) 


AB 


67973 


D1210 


(Cfl/C33b) 


EF 


67974 


D1210 


(C£l/CA290a) 


AB 


67975 


HB101 


(AB24/C100 #3R) 




67976 



The following derivatives of strain D1210 were deposited 
on 3 May 1989. 
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Strain Derivative 


ATCC No. 


pCFlCS/C8f 


67956 


pCFlAB/C12f 


67952 


pCFlEF/14c 


67949 


pCFlEF/15e 


67954 


pCFlAB/C25c 


67958 


pCFlEF/C33c 


67953 


pCFlEF/C33f 


67050 


pCFlCD/33g 


67951 


PCF1CD/C39C 


67955 


pCFlEF/C40b 


67957 


pCFlEF/CA167b 


67959 



The following strains were deposited on May 12 , 1989. 



ATCC No. 



Strain 

Lambda gtll(C35) 40603 

Lambda gtlO (beta-5a) 40602 

20 D1210 (C40b) 67980 

D1210 (M16) 67981 



Upon allowance and issuance of this application as a 
United States Patent, all restriction on availability of 
these deposits will be irrevocably removed; and access to 
the designated deposits will be available during pendency 
of the above-named application to one determined by the 
Commissioner to be entitled thereto under 37 CFR 1.14 and 
35 USC 1.22. Moreover, the designated deposits will be 
maintained for a period of thirty (30) years from the date 
of deposit, or for five (5) years after the last request 
for the deposit; or for the enforceable life of the U.S. 
patent, whichever is longer. The deposited materials 
mentioned herein are intended for convenience only, and 
are not required to practice the present invention in view 
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of the descriptions herein, and in addition these materi- 
als are incorporated herein by reference. 

5 

Industrial Applicability 
The invention, in the various manifestations 
disclosed herein, has many industrial uses, some of which 
are the following. The HCV cDNAs may be used for the 

iQ design of probes for the detection of HCV nucleic acids in 
samples . The probes derived from the cDNAs may be used to 
detect HCV nucleic acids in, for example, chemical 
synthetic reactions. They may also be used in screening 
programs for anti-viral agents, to determine the effect of 

i5 the agents in inhibiting viral replication in cell culture 
systems, and animal model systems. The HCV polynucleotide 
probes are also useful in detecting viral nucleic acids in 
humans, and thus, may serve as a basis for diagnosis of 
HCV infections in humans. 

20 In addition to the above, the cDNAs provided 

herein provide information and a means for synthesizing 
polypeptides containing epitopes of HCV. These 
polypeptides are useful in detecting antibodies to HCV 
antxgens. A series of immunoassays for HCV infection, 

25 based on recombinant polypeptides containing HCV epitopes 
are described herein, and will find commercial use in 
diagnosing HCV induced NANBH, in screening blood bank 
donors for HCV-caused infectious hepatitis, and also for 
detecting contaminated blood from infectious blood donors 
3Q The viral antigens will also have utility in monitoring 

the efficacy of anti-viral agents in animal model systems. 
In addition, the polypeptides derived from the HCV cOHAs 
disclosed herein will have utility as vaccines for treat- 
ment of HCV infections . 

35 ThS P^P^ides derived from the HCV cDNAs , 

besides the above stated uses, are also useful for raising 
anti-HCV antibodies. Thus, they may be used in anti-HCV 
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vaccines. However, the antibodies produced as a result of 
immunization with the HCV polypeptides are also useful in 
detecting the presence of viral antigens in samples. Thus, 
they may be used to assay the production of HCV 
polypeptides in chemical systems. The anti-HCV antibodies 
may also be used to monitor the efficacy of anti-viral 
agents in screening programs where these agents are tested 
in tissue culture systems. They may also be used for pas- 
sive immunotherapy, and to diagnose HCV caused NANBH by 
allowing the detection of viral antigen(s) in both blood 
donors and recipients. Another important use for anti-HCV 
antibodies is in affinity chromatography for the purifica- 
tion of virus and viral polypeptides. The purified virus 
and viral polypeptide preparations may be used in vac- 
cines. However, the purified virus may also be useful for 
the development of cell culture systems in which HCV 
replicates . 

Antisense polynucleotides may be used as 
inhibitors of viral replication. 

For convenience, the anti-HCV antibodies and HCV 
polypeptides, whether natural or recombinant, may be pack- 
aged into kits. 
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CLAIMS 



s 1. A recombinant polynucleotide comprising a 

sequence derived from HCV cDNA, wherein the HCV cDNA is in 
clone 131, or clone 26 J. or clone 59a, or clone 84a! or 
clone CA156e or clone 167b, or clone p il4a , or clone 
CA216a or clone CA290a, or clone ag 30a, or clone 205a, or 
w clone 18g, or clone 16jh, or wherein the HCV cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17. 

2. A recombinant polynucleotide according to 
claim 1, encoding an epitope of HCV. 

3. a recombinant vector comprising the 
polynucleotide of claim 1 or claim 2. 

20 , . , 4 ' A host transformed with the vector of 

claim 3 . 

5. A recombinant expression system comprising 
an open reading frame (ORF) of DNA derived from the re- 
" 1 IT P 01 r^:Leotide of claim 1 or claim 2, wherein 

Tl. 1±nked t0 3 C ° ntro1 Se ^ ence —Pat- 

ible with a desired host. 



30 



6. A cell transformed with the recombinant 
expression system of claim 5. 



6. 



7. A polypeptide produced by the cell of claim 



35 M ^ , 8 ' A P u *i*ied polypeptide comprising an epitope 

encoded within HCV cDNA wherein the HCV cDNA is of a 
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sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17. 

9 . An immunogenic polypeptide produced by a 
cell transformed with a recombinant expression vector 
comprising an ORF of DNA derived from HCV cDNA, wherein 
the HCV cDNA is comprised of a sequence derived from the 
HCV cDNA sequence in clone CA279a, or clone CA74a, or 
clone I3i, or clone CA290a, or clone 33C or clone 40b, or 
clone 33b f or clone 25c, or clone 14c f or clone 8f, or 
clone 33f , or clone 33g, or clone 39c, or clone 15e, and 
wherein the ORF is operably linked to a control sequence 
compatible with a desired host. 

10. A peptide comprising an HCV epitope, 
wherein the peptide is of the formula 

AA x -AA y , 



wherein x and y designate amino acid numbers shown in Fxg. 
17, and wherein the peptide is selected from the group 
consisting of AA1-AA25, AA1-AA50, AA1-AA84 , AA9-AA177, 
AA1-AA10, AA5-AA20, AA20-AA25, AA35-AA45, AA50-AA100, 
25 AA40-AA90, AA45-AA65, AA65-AA75, AA80-90, AA99-AA120, 
AA95-AA110, AA105-AA120, AA100-AA150, AA150-AA200, 
AA155-AA170, AA190-AA210, AA200-AA250, AA220-AA240, 
AA245-AA265, AA250-AA300, AA290-AA330, AA290-305, 
AA300-AA350, AA310-AA330, AA350-AA400, AA380-AA395, 
30 AA405-AA495, AA400-AA450, AA405-AA415, AA415-AA425, 
AA425-AA435, AA437-AA582, AA450-AA500, AA440-AA460, 
AA460-AA470, AA475-AA495, AA500-AA550, AA511-AA690, AA515- 
^50, AA550-AA600, AA550-AA625, AA575-AA605, AA585-AA600, 
AA600-AA650, AA600-AA625, AA635-AA665, AA650-AA700, 
35 AA645-AA680, AA700-AA750, AA700-AA725, AA700-AA750, 
' AA725-AA775, AA770-AA790, AA750-AA800, AA800-AA815, 
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AA825-AA850, AA850-AA875, AA800-AA850, AA920-AA990, 
AA850-AA900, AA920-AA945, AA940-AA965, AA970-AA990, 
AA950-AA1000, AA1000-AA1060 , AA1000-AA1025 , AA1000-AA1050 , 
AA1025-AA1040, AA1040-AA1055 , AA1075-AA1175 , 
AA1050-AA1200, AA1070-AA1100 , AA1100-AA1130 , 
AA1140-AA1165, AA1192-AA1457 , AA1195-AA1250 , 
AA1200-AA1225, AA1225-AA1250 , AA1250-AA1300 , 
AA1260-AA1310, AA1260-AA1280 , AA1266-AA1428 , 
AA1300-AA1350, AA1290-AA1310 , AA1310-AA1340, AA1345- 
AA1405, AA1345-AA1365, AA1350-AA1400, AA1365-AA1380 , 
AA1380-AA1405, AA1400-AA1450 , AA1450-AA1500, 
AA1460-AA1475, AA1475-AA1515 , AA1475-AA1500 , 
AA1500-AA1550, AA1500-AA1515 , AA1515-AA1550 , 
AA1550-AA1600, AA1545-AA1560, AA1569-AA1931, 
AA1570-AA1590, AA1595-AA1610, AA1590-AA1650, 
AA1610-AA1645, AA1650-AA1690 , AA1685-AA1770 , 
AA1689-AA1805, AA1690-AA1720 , AA1694-AA1735 , 
AA1720-AA1745, AA1745-AA1770 , AA1750-AA1800, 
AA1775-AA1810, AA1795-AA1850, AA1850-AA1900 , 
AA1900-AA1950, AA1900-AA1920 , AA1916-AA2021, 
AA1920-AA1940, AA1949-AA2124 , AA1950-AA2000 , 
AA1950-AA1985, AA1980-AA2000 , AA2000-AA2050 , 
AA2005-AA2025, AA2020-AA2045 , AA2045-AA2100 , 
AA2045-AA2070, AA2054-AA2223 , AA2070-AA2100 , 
AA2100-AA2150, AA2150-AA2200 , AA2200-AA2250 , 
AA2200-AA2325, AA2250-AA2330 , AA2255-AA2270 , 
AA2265-AA2280, AA2280-AA2290 , AA2287-AA2385 , 
AA.2300-AA2350, AA2290-AA2310 , AA2310-AA2330 , 
AA2330-AA2350, AA2350-AA2400 , AA2348-AA2464 , 
AA2345-AA2415, AA.2345-AA2375 , AA2370-AA2410 , 
AA2371-AA2502, AA2400-AA2450 , AA2400-AA2425 , 
AA2415-AA2450, AA2445-AA2500 , AA2445-AA2475 , 
AA2470-AA2490, AA2500-AA2550, AA2505-AA2540 , 
AA2535-AA2560, AA2550-AA2600 , AA2560-AA2580, 
AA2600-AA2650, AA2605-AA2620 , AA2620-AA2650 , 
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AA2640-AA2660, AA2650-AA2700 , AA2655-AA2670 , 

AA2670-AA2700, AA2700-AA2750 , AA2740-AA2760 , 
AA2750-AA2800, AA2755-AA2780 , 

AA2780-AA2830, AA2785-AA2810 , AA2796-AA2886 , 

AA2810-AA2825, AA2800-AA2850 , AA2850-AA2900 , 

AA2850-AA2865, AA2885-AA2905 , AA2900-AA2950 , 

AA2910-AA2930, AA2925-AA2950 , AA2945-end(C ' terminal). 

11. A polypeptide comprised of the peptide of 

claim 10. 

12. An immunogenic polypeptide attached to a 
solid substrate, wherein the polypeptide is according to 
claim 7, or claim 8, or claim 9, or claim 10, or claim 11, 
or wherein the polypeptide is comprised of an epitope 
encoded within HCV cDNA wherein the HCV cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17. 

13 . A monoclonal antibody directed against an 
epitope encoded in HCV cDNA, wherein the HCV cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17, or is the sequence present in 
clone 13i, or clone 26 j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pil4a, or clone 
CA216a, or clone CA290a, or clone ag30a, or clone 205a, or 
clone 18g, or clone 16 jh. 

14. A preparation of purified polyclonal anti- 
bodies directed against a polypeptide comprised of an 
epitope encoded within HCV cDNA, wherein the HCV cDNA is 
of a sequence indicated by nucleotide numbers -319 to 1348 
or 8659 to 8866 in Fig. 17, or is the sequence present in 
clone 13i, or clone 26 j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pil4a, or clone 
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CA216a or clone CA290a, or clone ag30a, or clone 205a, or 
clone 18g, or clone 16 jh. 

15. A polynucleotide probe for HCV, wherein the 
probe is comprised of an HCV sequence derived from an HCV 
cDNA sequence indicated by nucleotide numbers -319 to 1348 

HCV 8 !nL t0 8866 ^ 1? ' ° r fr ° m thS of the 

HCV cDNA sequence. 

16. A kit for analyzing samples for the pres- 
ence of polynucleotides from HCV comprising a 
polynucleotide probe containing a nucleotide sequence of 
about 8 or more nucleotides, wherein the nucleotide 
sequence is derived from HCV cDNA which is of a sequence 
mdicated by nucleotide numbers -319 to 1348 or 8659 to 
8866 in Fig. 17, wherein the polynucleotide probe is in a 
suitable container. 

17. A kit for analyzing samples for the pres- 
ence of an HCV antigen comprising an antibody which reacts 
immunologically with an HCV antigen, wherein the antigen 
contains an epitope encoded within HCV cDNA which is of a 

25 sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fig. 17, or wherein the HCV cDNA is in 
clone 13i, or clone 26 j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pii 4 a, or clone 
CA216a or clone CA290a, or clone ag30a, or clone 205a, or 

3q clone 18g, or clone 16jh. 

18. A kit for analyzing samples for the pres- 
ence of an HCV antibody comprising an antigenic 
polypeptide containing an HCV epitope encoded within HCV 
cDNA which is of a sequence indicated by nucleotide 
numbers -319 to 1348 or 8659 to 8866 in Fig. 17, or is in 
clone 13i, or clone 26 j, or clone 59a, or clone 84a, or 
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^a^o 56 ",' " Cl0 " e 16 ' b ' Cl ° ne « 

c^~ I, ° a ' M Cl °" a ag30a ' « 20 5a , or 

clone 18g, or clone 16Jh. 

13. A kit for analyzing samples for the pres- 
ence of an HCV antibody comprising an antigenic 
polypeptide expressed from HCV cDNA in clone CA279a, or 

10 ! °"? CA \ 4a ' " CX ° ne Ui ' ° r clone CUM., or clone 33c 
or clone 40b, or clone 33b, or clone 2Sc, or clone 14c or 
c one 8f, or clone 33f, or clone 33g, or clone 3 9 c, or' 

I suited' Wherei " antl ^ ni = Polypeptide is present in 
a suxtable container. 

15 

20. a method for detecting HCV nucleic acids in 
a sample comprising: 

(a) reacting nucleic acids of the sample with a 
polynucleotide probe for HCV, wherein the probe is 
2q comprised of an HCV sequence derived from an HCV cDNA 

-319 e to e i34 S 8° f indiGated * nucleotide numbers 

react!! °! *° 8866 ^ ^ 1? ' * nd " h ~ ei » 

reacting iS under conditions which allow the formation of 

a polynucleotide duplex between the probe and the HCV 
25 nucleic acid from the sample, 

enn , a . < b) detectin 9 a polynucleotide duplex which 
contains the probe, formed in step (a). 

comprising^' ^ f ~ Meeting an HCV antigen 

an HCV anJ^ inCU * ating a sam P le suspected of containing 
an HCV antigen with an antibody directed against an HCV 
ePxtope encoded in HCV cDNA, wherein the HCV cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 
8659 to 8866 in Fio 17 r,^ <«, 4-v 
35 ' 13 the s ««J*ence present in 

clone 131 or clone 26 j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone P ii4a, or clone 
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CA216a r or clone CA290a nr ~ 

clone lag, or clone ' * f °™ ag30a ' or c ^ne 205a, or 
y ' CIOne l6 1* f and wherein the incub a i-<n n • 

= r ditions which aiiow °* « 

antxbody complex; and d»i-.„«. ■ antigen- 

complex formed in step i ! antibod y-antigen 

step (a) whlch contains the antibQdy> 

directed a^s^cT^ *" ******* ™*°<"*° 
10 -gainst an HCV antxgen comprising: 

an epitope ± „ ^cO^e, = ~ 

t-xone CAi56e, or clone 167b, or rlon D 

r A?1< ; a . ' or clone pxl4a, or clone 

antibody complex; and antxgen- 

anti-HCV -iS.cS^JLtS^^-.T^ 

epitope encoded in hcv cdna, herein the Hcv ^, ™ 
se 9 „ence indicated ^ „ ucleotlde ^.^^ « 
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8659 to 8866 in Fig. 17 or is the sequence present in 
clone 13i, or clone 26j, or clone 59a, or clone 84a, or 
clone CA156e, or clone 167b, or clone pi!4a, or clone 
CA216a or clone CA290a, or clone ag30a, or clone 205a, or 
clone 18g, or clone 16jh, and wherein the immunogenic 
polypeptide is present in a pharmacologically effective 
dose in a pharmaceutical^ acceptable excipient. 



25. A method for producing antibodies to HCV 
comprising administering to an individual an isolated im- 
munogenic polypeptide containing an HCV epitope encoded in 
HCV cdna, wherein the HCV cDNA is of a sequence indicated 
i5 by nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 
17, or is of the sequence present in clone CA279a, or 
clone CA74a, or clone 13i, or clone CA290a, or clone 33c 
or clone 40b, or clone 33b, or clone 25c, or clone 14c, or 
clone 8f, or clone 33f, or clone 33g, or clone 39c, or 
2q clone 15e, and wherein the immunogenic polypeptide is 
present in a pharmacologically effective dose in a 
pharmaceutical^ acceptable excipient. 

26. An antisense polynucleotide derived from 
2s HCV cDNA, wherein the HCV cDNA is that shown in Fig. 17. 

27. A method for preparing purified fusion 
polypeptide C100-3 comprising: 

(a) providing a crude cell lysate containing 
polypeptide C100-3, 

(b) treating the crude cell lysate with an 
amount of acetone which causes the polypeptide to 
precipitate, 

(c) isolating and solubilizing the precipitated 
material r r 

(d) isolating the C100-3 polypeptide by anion 
exchange chromatography, and 
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«.p «, *L m t ^I ti 1 r atin ' th " C100 - 3 -^. P t ide of 

5 

comprising^"" * ** « HCV polypeptide 

frame (ORF, of DNA derived froml^ C D» tT" "'""I 

10 oDNA is in clone 13i, or clone «f ,' ""^ the HCV 

84a, or clone CA156e Tr clone 16 D "l " 0l ~ 

clone CA2l6a, or clone cJmT ,' * P " 4 "' or 

205a, or clone l 8g « Toll '7* « Cl0ne 
1S i» o f e s.guence Lic.t^"^ " ^ "» aC - «« 

1348 or 86S9 to 8866 in Pic 17 ! f - 3 " t0 

poiypaptid. 29 ^^, 3 ** PMparin9 an HCV 

of a sequence derived from the HCV c^a " C ° mpriS8d 
CA279., or clone CA74a, or SLT1 3L V*™ *" Cl ° n<! 

done 33c, or clone 40o, or clone * '"T^ C * 29 ° a ' " 
30 don. 14c, or clone 6f, 'or Mono 3 T/ O l\l 0 ™ ' " 

linked «' " =1 ° ne 15e ' WherSin ^ ° EP i» oper^ y ° r 
hosTand C00tr01 SSqUenCB haired 

which allow"! inCUOatin « the h °" =«" under conditions 
35 which allow expression of the HCV polypeptide. 

30. A method for preparing a host cell 
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transformed with a recombinant polynucleotide comprisino a 
sequence of hcv cdna derived from the HCV coma i^one 
131 or clone 26 j, or clone 59a, or clone 84a, or clone 
CA156e, or clone 167b, or clone pil4a, or clone L£ T 0 
clone CA290a, or clone ag30a, or clone 205a or cToTibJ 
or clone 16 jh , or wherein the HCV COMA is of a seance " 
indicated by nucleotide numbers -319 to 1348 or 8659 to 
8866 xn Fig. 17 comprising: 

(a) providing a host cell capable of 
transformation; 

providing the recombinant polynucleotide; 



(b) 



and 



15 „h> h ,i inCUbatin * < a > witl » (b) under conditions 

whxch allow transformation of the host cell with the 
polynucleotide. 

31. A method for preparing a recombinant 
2q polynucleotide comprised of a sequence of HCV cDNA derived 
from the HCV cDNA in clone 13i, or clone 26 j, or clone 
59a, or clone 84a, or clone CA156e, or clone 167b, or 
clone pi!4a, or clone CA216a, or clone CA290a, or clone 
ag30a, or clone 205a, or clone 18g, or clone 16 jh, or 
2s wherexn the HCV cDNA is of a sequence indicated by 

nucleotide numbers -319 to 1348 or 8659. to 8866 in Fig. 17 
comprising: 9 

, Pr ° Viding a hoat transformed with the 

recombinant polynucleotide; and 

30 u <b) isolat ing said polynucleotide from said 

host cell. 

32. A method for preparing blood free of HCV 
comprising: 

35 (a) P rovidin 9 « sample of blood suspected of 

containing HCV and anti-HCV antibodies; 
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(b) providing an immunogenic polypeptide 
prepared according to claim 28 or 29; 

5 . (C) incuba ting the sample 'of (a) with the 

immunogenic polypeptide of (b) under conditions w h 
allow the formation of antibody-HCV polypeptide comixes; 
and ^tect^g the complexes formed in step {c) ; 

10 not detectirin 3 ^ 9 ^ ^ ^ "« 

comprising' 3 ' A *" bl °° d f ~ °* «CV 

15 bl QO H J Pr ° Vlding »«l«±c acids from a sample of 

blood suspected of containing HCV polynucleotides^ 

Drobp . (b) P rovidin 9 « Probe for HCV, wherein the 

cDNA which is of a sequence indicated by nucleotide 
2Q numbers -319 to 1348 or 8659 to 8866 in Fig. 17 , 

whi.h n (C> . reaCt±ng < a > with (b) under conditions 
whxch allow the formation of a polynucleotide duplex 

sample? ^ ^ ^ nUClSlC aCid f ™ the 

- the «/i^rs^^r^ which contains 

not detecte^in 8 ^ ^ ^ «- — 

30 . 34. a method for producing a hybridoma which 

produces anti-HCV monoclonal antibodies comprising, 

(a) immunizing an individual with an 
x^unogenic polypeptide containing an epitope encoded in 
HCV cdma, wherein the HCV cdha i. HCV cDNA in cione l3i 
or clone 26,, or clone S9a, or clone 84a, or clone" CA15 6 e 
or clone 167b, or clone pi!4a, or clone CA216a, or c"" 
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CA290a, or clone ag30a, or clone 205a, or clone 18g, or 
clone 16jh r or wherein the HCV cDNA is of a sequence 
indicated by nucleotide numbers -319 to 1348 or 8659 to 
8866 in Fig. 17. or 

(b) immunizing an individual with an 
immunogenic polypeptide prepared according to claim 29; 

(c) immortalizing antibody producing cells from 
the immunized individual; 

(d) selecting an immortal cell which produces 
antibodies which react with an HCV epitope in the 
immunogenic polypeptide of (a) or (b) ; and 

(e) growing said immortal cell. 
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FIG. I Translation of DNA 12f 




181 

ACAGGTGGCCGGAGTAGGTGGAGGTCGTC^ 



3AGGACAAGGAAG 



ACOCCMTTOOTM^^ 



Val 

421 TTGTATC 
AACATAG 



SUBSTITUTE SHEET 
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FIG. 2r\ Translation of DNA k9-l 

GlyCysProGluArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGC ACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACTATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGATATAAATTTTAGT 



MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 



ArgCysAspLeuGluAspArgAspArgSerGliiLeuSerProLeuIieuIieuThrThrThr 
601 AACGTTGCGATCTGGAAGATAGGGACAGGTCCG AGCTC AGCCCGTT ACTGCTG ACC ACTA 
TTGCAACGCTAGACCTTCTATCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 



GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpAlalleLysTrpGluTyrValVall^uI^uPheLeuLeuLeixAlaAspAlaArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAGGAGGACAAGGAAGACGAACGTCTGCGCG 
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o , ^^y aLc y sSerC ysLeuTrpMetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAAGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTTCGCCGAAACCTCT 



LeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 



ae, tT ,^^^S^^^ aTrp ^ rLe ^ ysG1 y L y sTr P ValPr ^ 1 y^ aV alTyrThrPhe 
961 TGTTCTTCTGCTTTGCATGGTATCTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAGACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 



1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG* 



-i m ^ A f P TtoGluVa lAlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThr 

1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTAA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGATT 



1141 ctcoxstcacSSttaSag^StatcaStggtcc^ 

GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 



■i m ^ T ^^? ValGluAlaGlhLe uHisValTrpIleProProLeuAsnValArgGlyGlvAra 
1 ^^^^^^"^^^^CAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 



1261 f^^^^^^^^^^S^S^S^^^^&^OT^^^^ 

cgctgcgacagtagaatgagtacacacgacatgtgggctgagaccataaactgtagtggt 



, „ „ , I^uLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAla 
13 21 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTC 



FIG. 2-2 
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FIG. 3 

-, ^i^i^Si^ LysArgVal ^ r ^ r ^ uT ^^5AspProThrThrProI^uAlaArqAl^ 
1 CGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGC 
GCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCG 

c -, ^i^^ 1 " ThrAlaAr 9HisThrProValAsnSerTrpLeuGlyAsnIleIleMetPhe 
61 TG CGTGGG AG ACAG CAAG ACACACTCCAG TCAATTCC TGG CTAGG CAACATAAT CATG TT 

ACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAA 



AlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeulleAla 
121 TGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGC 
ACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCG^GGAATATCG 

o, ^ A ? gAs P GlnLeuGluGlnAlaLeuAs PCysGluIleTyrGlyAlaCysTyrSerIleGlu 
181 CAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGA 
GTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCT 

ProLeuAspLeuProProIlelleGlnArgLeu 
241 ACCACTTGATCTACCTCCAATCATTCAAAGACTC 
TGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 



FIG. 5 

Translation of DNA 26 j 

cgaaaagatagtggtgttcaagttgagaagtcS^ 
tgsggaatggctaaaactggtcccgaccccgggataSc^ 

121 cSsassaggas^^ 

gctggtcgcggggatgacgaccgtgatgggg^g^ 

181 GAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTCG^r^rr 
CTCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCACCA^ 
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FIG. 4 

Translation of DNA 13i 

' SSBSSaBBSSSSSasaS' 

- JsssssasBBBssgae- 

i2i Tc^lISg^f^fF^i^J^fYaicyssiyAi 




»TGTGGGTCCACGGACCAGC 



GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCGAGT 



~~ ~ Overlap with 12 f 



l rarJ?rSoX al ^ UProCysSerPheT ^Th^LiuP^oAl 
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FIG. 6 



Translation of DNA CA59a 

LeuVa lMetAl aG 1 nLeuLeuAr g 1 1 eP r oG 1 n Al all eLeuAs pMe t IleAlaGlyAla 
. 1 TTGGTAATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCT 
AACCATTACCGAGTCGACGAGGCCTAGGGTGTTCGGTAGAACCTGTACTAGCGACCACGA 

HisTrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlyAsnTrpAlaLysVal 
61 CACTGGGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTC 
GTGACCCCTCAGGACCGCCCGTATCGCATAAAGAGGTACCACCCCTTGACCCGCTTCCAG 

LeuValValLeuLeuLeuPheAlaGlyValAspAlaGluThrHisValThrGlyGlySer 
121 CTGGTAGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGT 
GACCATCACGACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCA 

AlaGlyHisThrValSerGlyPheValSerLeuLeuAlaProGlyAlaLysGlnAsnVal 
181 GCCGGCCACACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAG AACGTC 
CGGCCGGTGTGACACAGACCTAAACAATCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAG 

GlnLeuIleAsnThrAsnGlySerTrpHisLeuAsnSerThrAlaLeuAsnCysAsnAsp 
241 CAGCTGATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGAT 
GTCGACTAGTTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTA 



.SerLeuAsnThrGlyTrpLeuAlaGlyLeuPheTyrHisHisLysPheAsnSerSerGly 
301 AGCCTCAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGC 
TCGGAGTTGTGGCCGACCAACCGTCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCG 

Overlap with 26 j 



Overlap with K9-1 

Cy s P r oG 1 uAr gLeuAl aS er Cy s Arg P r o 
361 TGTCCTGAGAGGCTAGCCAGCTGCCGACCCC 
ACAGGACTCTCCGATCGGTCGACGGCTGGGG 
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Translation of DNA CA84a FIG 7 

GlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArgMetAlaTrpAsp 
CGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGGG 
GCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACCC 



MetMetMetAsnTrpSerProThxThrAlaLeuValMetAlaGlnLeuLeuArgllePro 
6 1 ATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATCC 
TATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAGG 

GlnAlalleLeuAspMetlleAlaGlyAlaHisTrpGlyValLeuAlaGlylleAlaTyr 
121 CACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCGT 
GTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGCA 

Overlap with CA59a 

PheSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLeuPheAlaGlyVal 
181 ATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGCG 
TAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCGC 



AspAlaGluThrHisValThrGly 
241 TCGACGCGGAAACCCACGTCACCGGGG 
AGCTGCGCCTTTGGGTGCAGTGGCCCC 



Translation of DNA CA156e F\G 8 

CysTrpValAlaMetThrProThrValAlaThrArgAspGlyLysLeuProAlaThrGln 
1 GTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCA 
CACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGT 

LeuArgArgHisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyrVal 
61 GCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGT 
CGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGGGAGATGCA 

GlyAspLeuCysGlySerValPheLeuValGlyGlnLeuPheThrPheSerProArgArg 
121 GGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCG 
CCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGAGGGTCCGC 

HisTrpThrThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArg 
181 CCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCG 
GGTGACCTGCTGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGC 

Overlap with CA84a 

MetAlaTrpAspMetMetMetAsnTrpSerProThrThrAlaLeuValValAlaGlnLeu 
241 CATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAGTGGCTCAGCT 
GTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATCACCGAGTCGA 



LeuArglleProGlnAla 
301 GCTCCGGATCCCACAAGCC 
CGAGGCCTAGGGTGTTCGG 
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FIG. 9 



Translation of DNA CA167b 

SerThrGlyLeuTyrHisValThrAsnAspCysProAsnSerSerlleValTyrGluAla 
1 CTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGC 
GAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCG 

AlaAspAlalleLeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSer 
61 GGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAACGCCTC 
CCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAG 



ArgCysTrpValAlaMetThrProThrValAlaThrArgAspGlyLysLeuProAlaThr 
121 GAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGAC 
CTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTG 

: Overlap with CA156e 

GlnLeuArgArgHisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyr 
181 GCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCTACCCTCTGTTCGGCCCTCTA 
CGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGATGGGAGACAAGCCGGGAGAT 



ValGlyAspLeuCysGlySerValPheLeu 
24 1 CGTGGGGGACTTGTGCGGGTCTGTCTTTCTTG 
GCACCCCCTGAACACGCCCAGACAGAAAGAAC 
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FIG. 10 

Translation of DNA ssCA216a 




121 



181 



241 



301 



GCGTACCGCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCC^GG 
CGAGAAAGAGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGG^ 

™^™2^ gAsnSerTtoG1 y^ u ^ H ^^ 

TGGTTCACGCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGAMGA^ 

~~r~ ~~~7" 7 overlap with CAl67b 

aacacatgcttcgccggctacggtaggacgtgtgaggccccacgSS 

G i vA ? ^i a SerArgCysTrpValAlaMetThrProThrValAlI 
3 61 agggcaacgcctcgaggtgttgggtggcgatgacccctacggtggcc 
tcccgttgcggagctccacaacccaccgct 
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FIG. II 

Translation of DNA ssCA290a 

LysLysAsnLysArgAsnThrAsnArgArgProGlnAspValliysPheProGlyGlyGly 
1 AAAAAAAAAACAAACGTAACACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCG 
TTTTTTTTTTGTTTGCATTGTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGC 

GlnlleValGlyGlyValTyrLeiiLeuProArgArgGlyProArgLeuGlyValArgAla 
6 1 GTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCG 
CAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGC 

ThrArgLysThrSerGluArgSerGlnProArgGlyArgArgGlnProIleProLysAla 
121 CGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGTAGACGCCAGCCTATCCCCAAGG 
GCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCATCTGCGGTCGGATAGGGGTTCC 

ArgArgProGluGlyArgTh3rTrpAlaGlnProGlyTyrProTrpProLeuTyrGlyAsn 
181 CTCGTCGGCCCGAGGGCAGGACCTCGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCA 
GAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGT 

GluGlyCysGlyTrpAlaGlyTrpLeuLeuSerProArgGlySerArgProSerTrpGly 
241 ATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGG 
TACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCC 



ProThrAspProArgArgArgSerArgAsnLeuGlyLysVallleAspThrLeuThrCys 
301 GCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGT 
CGGGGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCA 



GlyPheAlaAspLeuMetGlyTyrlleProLeuValGlyAlaProLeuGlyGlyAlaAla 
361 GCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTG 
CGCCGAAGCGGCTGGAGTACCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGAC 

overlap with CA216a 

ArgAlaLeuAlaHisGlyValArgValLeuGluAspGlyValAsnTyrAlaThrGlyAsn 
421 CCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGA 
GGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCT 



LeuProGlyCysSerPheSerThrPhe 
481 ACCTTCCTGGTTGCTCTTTCTCTACCTTC 
TGGAAGGACCAACGAGAAAGAGATGGAAG 
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FIG. 12 | #MetSerValValGlnProProGlyProProLeu 

#MetAlaLeuValOP 

1 CGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCC 
GCGTCTTTCGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGG 

ProGlyGluProAM 

6 1 TCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGAC 
AGGGCCCTCTCGGTATCACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTG 

#MetProGlyAspLeuGlyValProProGlnAsp 

121 CGGGTCCTTTCTTGGATCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGA 
GCCCAGGAAAGAACCTAGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTT'CT 

OP AM GlyAlaCys 
CysAM * 

181 CTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTT 
GACGATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAA 

I 

GluCysProGlyArgSerArgArgProCysThrMetSerThrAsnProLysProGlnLys 

241 GCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAA 
CGCTCACGGGGCCCTCCAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATTTGGAGTTT 

LysAsnLysArgAsnThrAsnArgArgProGliiAspValLysPheProGlyGlyGlyGln 

301 AAAAAAACAAACGTAACACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTC 
TTTTTTTGTTTGCATTGTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAG 

IleValGlyGlyValTyrLeuLeuProArgArgGlyProArgLeuGlyValArgAlaThr 

361 AGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGA 
TCTAGCAACCACCTCAAATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCT 

ArgLysThrSerGluArgSerGlnProArgGlyArgArgGlnProIleProLysAlaArg 

421 CGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTC 
GCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAG 

ArgProGluGlyArgThrTrpAlaGlnProGlyTyrProTrpProLeuTyrGlyAsnGlu 

overlap with CA290a 

481 GTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATG 
CAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGTTAC 

GlyCysGlyTrpAlaGlyTrpLeuLeuSerProArgGlySerArgProSerTrpGlyPro 

541 AGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCC 
TCCCGACGCCCACCCGCCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCCCGG 

ThrAspProArgArgArgSerArgAsriljeuGlyLysVallleAspThrLeuThrCysGly 
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601 CCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCG 
GGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGC 

Phe 



661 GCTTC 
CGAAG 



* - Start of long HCV ORF 

I - Putative first amino acid of large HCV polyprotein 

# - Putative small encoded peptides (that may play a 

translational regulatory role) 



FIG. 12-2 
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FIG. 13 

Translation of DNA CA205a 



61 



121 



241 



301 



* - putative initiator methionine codon 



SUBSTITUTE SHEET 



WO 90/11089 



PCT/US90/01348 



14/ 40 



FIG. 14 

Translation of DNA 18g 



#ProProOP 

#22525^^ Pro 
GAGGTGGTACTTAGTGAGGGGACAcS 



#MetSerValValGlnProProGlyProProLeuProGlyGluProAM 
MetAlaLeuValOP 

61 ^^^^^^^^^^^ 



121 ^^^^^^^^^^^^ 

overlap with ag30a 



#MetProGlyAspLeuGlyValProProGlnAspCy 



sAM 



181 sssssss»sagassggssEssssssg^gsg3ssgsssss 



OP AM 



GlyAlaCysGluCysProGlyArgSer 



241 SSSSSSSSSSSS^^ 

ArgArg 



301 CTCGTAGA 
GAGCATCT 



* = Start of long HCV ORP 

# - Putative small encoded peptides (that mav 

play a translational regulatory role) 
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FIG. 15 



Translation of DNA 16 jh 
—-—Overlap with 15e 



Gly* 

121 C?^f^OT?rr^ 1PrOPr °^^AlaTrplrgHi i 




CGTGGCCCGGGCCTCGCAGGCG 



... AlaValArgThrLysLeuLys 
241 GCAGTAAGAACAAAGCTCAAAC 
CGTCATTCTTGTTTCGAGTTTG 



nucleotide heterogeneity 
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COMBINED ORF OF DNAs pil4a THROUGH 15e 

(pil4a/CA167b/CA156e/CA84a/CA59a/K9-l/12f/14i/llb/7f/7e/ 

8h/33c/40b/37b/35/36/81/32/33b/25c/14c/8f/33f/33g/39c/ 
35f/19g/26g & 15e) 

•■ asssassaBsessa 
" mmtBSBBBssmm 
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Thr FIG 16—2 

84 1 £^GTGTCTGGATTTCTTA^ 

TGACACAGACCTAAACAATCGGAGGAGCGTGG^ 

sol sssssss^^ 

T TG TCGT TGCCGXCAAC?G^G C G C A«^^ 

~ aassssaaBBassBs 

1081 ScSSgS^^ 

" aaaaBaassaaaBaaB 

"" MBWBBHBBSBHaaB 

1741 J^^g^^Sg|||^^«Il ? a.s.ri r pAla 1 l. 

isoi ^«S^?^^^^^ss^ 
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PIP I A— "5* 

TTCACCCTCATGCAGCAAGAGGACAAGG^GACGAACGTCTGCGCGCGCAGACGAGGACG 



1861 TtSSSSSSbcS^ 



1981 



IBagSassSSEBi 

*•" ssassassaasssasa 



2221 



- ^^^^ 

- mssseesssessss 

mES&BBBB&SSB&i 



2761 



asBssHssasasBSBss 
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ValGluGlvGluValfil nTlpis 

2821 



2881 



294 



FIG. 16-4 

CACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAG 
TTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTC 
CCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGC 

gttccatcggcgagtaactgtgggacgtgaacgccgaggag^tmwStggwcagtgc 

TCCGTGCGGCTACAGIAA^aC^CCKC^ASG^CCCG^ii^Glc^ 
GGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCC^GCGACAAC^^GGCGCCCC 

1 ^^sm^^S^g 



3061 



3121 



3181 



324 



3301 



3361 



CCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCC^TATTCCACGATCATGAG 
342! ^I^I^SlAAcSS^^^SSG^IIlGAic 

CTAGGATTGTAGTCCI^CCCCACTCTTGTTAATGGTGAWGT^GGOTAGTCCATGAGG 

TGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAA 

ACACTGCTCACGGTGAGGIGCCTACGGTGTAGGTAGAACCCGTAGCCGTCA^GGAACTG 

GTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTCGCGGTCGGGAGGCCCGAGG 
G^^SSl?l?AlS^S?^^^5Si^J^S 



3541 
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FIG 16— "5 

AAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACA 

eAsnAla 

GTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGG 
CACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTG^CGCCGCTACA^AGCA^ 

caccgttggctacgggagtactggccgatatggccgctgaagctgagccactatS 
ttatgcacacagtgggtctgtcagctaaagtcg^^ 

saaBBBsa 



- ssBsasamssBssssm 
•**' aSS&BsesassBEsm 

- smsBsaasssasBss 




444 

CGATCCCGAGTTCGGGGAGGGGGTAGC^CCCTGG^CTACAC^^ 

TTCGGGTGGGAGGTACCCGGTTGTGGGGACGMA^CTC^ 
4561 AraKCTCACsrirrrSr SIS * ysTyr I3 - e MetThrCy sMetS erAlaAspLeuGlu 

tagtcggac ^cgtgggtcagtggtttatgtama1tc?acw^ 

4621 ' 5S5SES^ 

cagcagtgctcgtggacccacgagcaa^ 

46S1 gKS^^ 

gacagttgtccgacgcaccagtatcac^ 

4741 iSSSKSSi^^ 

tatggactgtcccttcaggagatgg^^ 
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PIG I C 6 

4801 TT^ GG ?^TCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGC^ 

AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

. LeuLeuGlnThrAlaSerAr 9GlnAlaGluValIleAlaProAlaValGlnThrAsnTrD 

4661 ^3££SE8ffi&g&S3£g^^jggS8& 

GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

50,1 i^SS^SSSS^S^SSSSS^SSSS^SS^S 

CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGG^GGAGAAGTTGTATA^CCCC^ 

5101 ^^^^SSS^SSESffiS^SSS 

ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 




5161 CGACCG-^^^ 

5221 asssssssgs^^ 

ATACCGCGCCCGCACCGCCCTCGAGAACACCGT^^ 

= SH?lF G1 ^ s P^ uValAs nLeuLeuProAlaIleLeuSerProGlvAlaLeuValVal 
5281 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCC^GA^C^CGTArTr 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTC 

CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCC^ 

e , „, 2£ pMe ^ snAr 9 LeuIle AlaPheAlaSerArgGlyAsnHisValSerProThrHtsTvT- 
5401 f^ATCAACCGGCTGATAGCCTT^ 

ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTCCGTCATG 

a ac, X^^^^?^ rAspAlaAlaAlaAr 5ValThrAlaIleLeuSerSerLeuThrValThr 
5461 GTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCT^CTGTAACC 
CACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGG 

5521 GAGGTGC T^GCGACTG^ 

GTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCA 

5581 TCCTGGCTA^GGACATCTCGGACTGgS^ 

AGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACC 

e « ■. ^^ysAiaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGly 
5641 CTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGG 
GATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCC 

HM Ty. rL y sG lyValTrpArgValAspGlyIleMetHisThrArgCysHisCysGlyAlaGlu 
5701 TATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAG 
ATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTC 

IleThrGlyHisValLysAsnGlyThrMetArglleValGlyProAraThrCvsAroAsn 
5761 ATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAAC 
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TAGrGACCTGTACAGTTTTTGCC^TGCTAC^CT^CAGCCAGGATCCTGGACGTCCTTG 

MetTr PSerGlyThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeuPro 
5821 ATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCT 
TACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGA 

5881 GCGCCG AACTACACGTTCGCGCTATGGAGGGT^ 

CGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCC 

, GlnValG1 y As P p heHisTyrValThrGlyMetThrThrAspAsnLeuLysCvsPrc3Cvs 
5941 CAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGC 
GTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACG 

6001 

GTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGC 

6061 




GGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATG 



S"Y^^ySe^lnI^uProCysGluProGluProAspValAlaValLeuThrSerMet 

6121 ^^^SSS^S^SSSSS^^^S 

6181 sssssgsg^^ 

GAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGC^CCAACCGCTCCC^ 




6241 . _ _ _ *„»ww*v^ i-«xv- ■ iV.v_fl.xu-J l-UAAGGCAACT 

GGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGA 

6301 ^^^SSBS^SSSSSSgSSS3SaSJSSS3 

ACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCC 

GTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTG 

«a -51 P^ oLeuV alAlaGluGluAspGluArgGluIleSerValProAlaGluIle 
6421 TCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATC 
AGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAG 

c „ o ^^!^2^H SerArgAr 5PheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsn 
6481 CTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAAC 
GACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTG 




6541 — „™™ wwwv ■ v , yi , .,,..^. r ... rGT 

GGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACA 

,* J£<£«*ProProPrdLy sSerProProValProProProArgLysLysArgThrValVal 
6601 CCGCTTCC^CCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTC^ 

GGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAG 

I^uThrGluSerThrLeuSerThrAlaLetiAlaGluLeuAlaThrArgSerPheGlvSer 
6661 CTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAGC 
GAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCG 

SerSerTltt SerGlyIleThrGlyAspAsnThrThrThrSerSerGluProAlaProSer 
6721 TCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCT 
AGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGA 
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FIG 16—8 

CCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTC 



GGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGKAGT^TCACTCCGGTTGCGC 

ctcctacagcacacgacgagitacagaatgagaacctgtccgcgtoakmtcmgca?§ 



cggcgccttcttgtctitgacgggtagttacgtgaitcgttgagcaacgatccIgtcgt§ 

7141 H^^^^S^^B 

^^P^^^A?^K^^CGiAA«J? 

ttcggtcgagcagagtagcacaaggggctagacccgcaS55g^55S^"a1?Ia 

7441 ^^^^BSSSSSSSffiiSSgfflSsaEg^ 

AA<*TGCTGCACCAATGMTCGAGGGGAACCGGCACTACCCTTCGAGM^ 



7501 3|S5p[SSESi^^ 

atgagtggtcctgtcgcccaacttaa^cacgttcgcacc^Sotc^ggggt 
taccccmgagcatactatgggcgacgaaactgaggtgt^Sactctcgctgtaggca 

7621 ^^^^^ssa^saa^sKBSggasssgasssgssK 

TGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTC 

agggagtggctctccgaaatacaacccccgggagaatggttaag^ 
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FIG. 16-9 

CCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGA 

, on , SysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeu 
i 801 TGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTC 
ACG ATG T AGTTC CGGGCCCG TCGG AC AG CTCGG CG TCC CG AGGTCC TG ACG TGG TACG AG 

ValCysGlyAspAspLeuValVallleCysGluSerAlaGlyValGlnGluAspAlaAla 
7 861 GTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCG 
CACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGC 

SerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProProGlyAspProPro 
7921 AGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCA 
TCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGT 

GlnProGlu TyrAspLeuGluLeuIleThrSerCysSerSerAsnValSerValAlaHis 
7981 CAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCAC 
GTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGTG 

AspGlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThrThrProLeuAlaArg 
8041 GACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGA 
CTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCT 

« , „ , AlaAla TrpGluThrAlaArgHisThrProYalAsnSerTrpLeuGlyAsnIleIleMet 
8101 GCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATG 
CGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTAC 

PheAlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeuIle 
8161 TTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATA 
AAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATAT 

AlaArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlle 
8221 GCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATA 
CGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTAT 

GluProLeuAspLeuProProIlelleGlnArgLeu 
8281 G AACCACTTG ATCTACCTCCAATCATTCAAAGACTC 
CTTGG TGAACTAGATGGAGGTT AGTAAG TTTCTG AG 
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F| G. 17-1 

-319 CACTCCACCATGAATCACTCCCCTGTGAGGAACTACTGTCTTCACGCAGAAAGCGTCTAG 
GTGAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATC 

-259 CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTAT 

-199 GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
CACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCT 

-139 TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCGGCTCATCA 

- 7 9 GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG 

CAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTC 

- 19 GTCTCGTAGACCGTGCACC 

CAGAGCATCTGGCACGTGG 

Arg Thr 

MetSerThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 
1 ATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAACACCAACCGTCGCCCACAG 
TACTCGTGCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATTGTGGTTGGCAGCGGGTGTC 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 
61 GACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 
CTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCC 

GlyProArgLeuGlyValArgAlaThrArgLysThrSerGluArgSerGlnProArgGly 
121 GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
CCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCA 

ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 
181 AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
TCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCC 

TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 
241 TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 
ATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGG 

ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsnLeuGly 
301 CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 
GCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCA 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMetGlyTyrlleProLeuVal 
361 AAGGTCATCGATACCQTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
TTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTACCCCATGTATGGCGAGCAG 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 
421 GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
CCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTG 

Thr 

GlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePheLeilLeuAla 
481 GGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTCTCTATCTTCCTTCTGGCC 
CCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAGAGATAGAAGGAAGACCGG 

LeuLeuSerCysLeuThrValProAlaSerAlaTyrGlnValArgAsnSerThrGlyLeu 
541 CTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 
GACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCACGCGTTGAGGTGCCCCGAA 

TyrHisValThrAsnAspCysProAsnSerSerlleValTyrGluAlaAlaAspAlalle 
601 TACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGCGGCCGATGCCATC 
ATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCGCCGGCTACGGTAG 

LeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSerArgCysTrpVal 
661 CTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAACGCCTCGAGGTGTTGGGTG 
GACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAGCTCCACAACCCAC 
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FIG.. 17=2 

™, ^^ etThrp roThrValAlaThrArgAspGlyLysLeuProAlaThxGlnLeuArgArq 
721 GCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCAGCTTCGACGT 
CGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGTCGAAGCTGCA 

„ „ , ? i 5 IleAspLe, ^ uValG1 y Ser ^ aTtoL euCysSerAlaLeuTyrValGlyAspLeu 
781 CACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGTGGGGGACCTA 
GTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGGGAGATGCACCCCCTGGAT 

« . , c y sG1 y SerVa lPheLeuValGlyGlnLeuPheThrPheSerProArgArgHisTrpThr 
841 TGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCGCCACTGGACG 
ACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGAGGGTCCGCGGTGACCTGC 

ThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArgMetAlaTrp 
901 ACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGG 
TGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACC 



Val 



AspMetMel^etAsnTrpSerProThrThrAlaLeuValMetAlaGlnLeuLeiiArgile 
961 GATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATC 
CTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAG 

-, n ^ ^°G lnAlaIleL euAspMetIleAlaGlyAlaHisTrpGlyValLeuAlaGlyIleAla 
1021 CCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCG 
GGTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGC 

-■ « « , ^ rPheS ^ MetVal GlyAsnTrpAlaI.y sValLeuValValLevLLeiiLeuPheAlaGlv 
1081 TATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGC 
ATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCG 

Y a ^ s P^ aGluThrHis ValThrGlyGlySerAlaGlyHisThrValSerGlyPheVal 
1141 GTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCACACTGTGTCTGGATTTGTT 
CAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGTGTGACACAGACCTAAACAA 

SerLeuLeuAlaProGlyAlaLysGlnAsnValGlnLeuIleAsnThrAsnGlySerTrp 
1201 AGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGATCAACACCAACGGCAGTTGG 
TCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTAGTTGTGGTTGCCGTCAACC 

HisLeuAsnSerThrAlaLeuAsnCysAsnAspSerLeuAsnThrGlyTrpLeiaAlaGly 
1261 CACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAACACCGGCTGGTTGGCAGGG 
GTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTGTGGCCGACCAACCGTCCC 

- LeupheT y r HisHisLysPheAsnSerSerGlyCysProGluArgLeiiAlaSerCvsAra 
13 21 CTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGA 
GAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACTCTCCGATCGGTCGACGGCT 

ProLeuThrAspPheAspGlnGlyTrpGlyProIleSerTyrAlaAsnGlySerGlyPro 

13 81 CCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTATGCCAACGGAAGCGGCCCC 

GGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAATACGGTTGCCTTCGCCGGGG 

AspGlnArgProTyrCysTrpHisTyrProProLysProCysGlylleValProAlaLys 

14 4 1 GACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGCGGTATTGTGCCCGCGAAG 

CTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGCTTC 

SerValCysGlyProValTyrCysPheThrProSerProValValValGlyThrThrAsp 
1501 AGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTGGTGGGAACGACCGAC 
TCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCACCACCACCCTTGCTGGCTG 

ArgSerGlyAlaProThrTyrSerT3^>GlyGlxiAsnAspThxAspValPheValLeuAsn 

15 61 AGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATACGGACGTCTTCGTCCTTAAC 

TCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATGCCTGCAGAAGCAGGAATTG 

AsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrpMetAsnSerThrGlyPhe 
1621 AATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTGGATGAACTCAACTGGATTC 
TTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGACCTACTTGAGTTGACCTAAG 
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Leu 




GGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATCCGAAACCGT^TAGGAACA 

i92i ^SSaSSSSSSS^SSSBaa^ss^^^^ 

:tgtccctgtccagg 




TMAGGGTTCGCCTCCGCCGAAACCTCTTGG&GCAT^TG^TI^GTCGTAGGGACCGG 
CCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTAC^TAA^MCCCA 

Asn 

2521 i™*^^^ 

accacgaacaccaccgaagtcataaaagactggtctcaccttcgcgttcacctgcacaot 

taagggggggagttgcaggctccccccgcgctccggcagtagaatcaSaca 
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2641 ^~ssssas^S^s£^sag^ 



TAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGC^G^ 




^GACGCGCAATCGCGCCTTCTACTA^^ 




2821 

l^TCCCCGCGAATGACCGTGGATACAAATATTGGTAGAG^G^ 




GGGCACGCGGCCGCCCCACTATCGTC^ 

3481 raSESira 
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360, asagsgsgsssgga^gSissggs*-* 1 *^ 

GTC^GGTCCACCGAGTGG^Sg^^g^^C^GTC 
GGCCGACGTATACGTCGAGTCC^S^^i^ii^^ 

3781 ^^^^sgsgssssssisasgsgg^ 



TGTCACCCGAAACCACGl^TGTACAGGTTCC^G^TCCTAGCTAGGATTCTAGTCCT^ 



~ saassEfflass 




4021 

-gaggcagtgacacggggtaggg 
^ccgttggctacgggag 



4321 



4381 A^SS?™?^^^ 



4441 




CGACAGAGGGTOIGAGTIG-^SGTCSTCA^Sa^g^^^A 
4501 ^SSG^SIiS^SfSl^l^^Pl^^CysGluCys 

SUBSTITUTE SHEET OM * c * CTCKG 
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4801 



4861 



4921 




GTGGTTTATGTAGTACTGTACGTAO^GCCGGCTGGACCTC^G^GTCCTCGTGGACC 



ACTGTCCCTTCAG 

5ioi cSSES^ 



CAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGG 



GACGGACCATTGGGGCGGTAACGAAGTAACTATCGAA^TCTCG^GA^GTCGTCGGGT 

540! SSS^^S^^~^ 
GATTGGTGATCGGTTTGGGAGGAGAACTTCTATAACTCCCCCACCCACCG 

GGGCGGGGGCCACGGCGATGACGGAAACACCCGCG^CGAATCGACCGCGGCGCTAGCTC 
SerValGXyLeuGlyLysValLeuIleAspIleLeuAlaGlyTyrGlyAlaGlyValAla 

SUBSTITyTl SHEET 
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576i ^S^S^^2^2S^?ISS^^ ProGlus ^^ 

5821 ^S^^S^^H^J^S^^^uftrgtogLeu 

STCGCGCCCATATTCCCCCAGACCGCT 

Gly 




6121 iS^^SSS^SS^^^^^^^^rn^^ 

6181 r?Si^^i^^^?^il!???y sThr ProLeuPr( 



6241 ^S??S^S^5ii?Sj!;S^Ir£Y?WluIleArg G lnVal G lvAsoPh e Hl„ 

alGlySerGlnLeu 
CGGTAGGGTCGCAATTA 



6421 ^G^aSg^^ 
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Ser 




PheAla 

714! ^^mS^SaSa^S^^S^^^SS^^*^ 



2CTAGAA 

SUBSTITUTE SHSBT 
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Phe 



7741 



7801 




TTACCCCAAGAGCATACTA 



7921 



™if'^ heAs PSerThrvalThrGluSerA S 



^GGACAGCTCGGCGTCCCGaIgSS^ 

SUBSTITUTE SHEET mtctcc 
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ValTyrTyrLeuThrArgAspProThrThrProLeuAlaArgAlaAlaTrpGluThrAla 
8401 GTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCA 
CAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGT 

ArgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrp 
8461 AGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGG 
TCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACC 

AlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGlu 
8521 GCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAA 
CGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTT 

GlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuPro 
8581 CAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCT 
GTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGA 

ProllelleGlnArgLeuHisGlyLeuSerAlaPheSerLeuHisSerTyrSerProGlv 
864 1 CCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTCTCCAGGT 
GGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGAGGTGTCAATGAGAGGTCCA 

GluIleAsnArgValAlaAlaCysLeuArgLysLeuGlyValProProLeuArgAlaTrp 
8701 GAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGTACCGCCCTTGCGAGCTTGG 
CTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCATGGCGGGAACGCTCGAACC 

Gly 

^f^™^^S S ^ rValAr 5AlaArgLeuLeuAlaArgGlyGlyArgAlaAlaIle 
8761 AGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAGAGGAGGCAGGGCTGCCATA 
TCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTCTCCTCCGTCCCGACGGTAT 

CysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLeiiLys 
8821 TGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCTCAAAC 
ACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGAGTTTG 



FIG. 17-10 
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Some conserved co-lin ear peptides in HCV & Flaviviruses 
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SELF -ASSEMBLING POLYNUCLEOTIDE DELIVERY SYSTEM 

Technical Field 

This invention is in the field of oligonucleotide 
delivery and gene therapy. In particular this invention is 
5 directed to a self -assembling polynucleotide delivery 
system comprising components aiding in the delivery of the 
polynucleotide to the desired address which are associated 
via noncovalent interactions with the polynucleotide. The 
components of this system include DNA-masking components, 
10 cell recognition components, charge-neutralization and 
membrane-permeabilization components, and subcellular 
localization components . 

Background Art 

Cystic Fibrosis (CF) is a fatal recessive genetic 

15 disease characterized by abnormalities in chloride 
transport (McPherson & Dorner, 1991) . The locus of the 
disease has been traced to mutations in the gene encoding 
the cystic fibrosis transmembrane conductance regulator 
(CFTR) . J.R. Riordan et al., Science (1989) 245:1066-1073; 

20 B. Kerem et al . , Science (1989) 245 : 1073 -1080 . Correction 
of the underlying gene defect by complementation or 
replacement of the defective CFTR is the ultimate cure for 
CF. Gene therapy, the in vivo delivery and expression of 
genes, is a fast -developing science that can be used to 

25 replace defective genes. 

Gene therapy is already feasible. T. Friedmann, 
Science (1989) 244:1275-1281; M. Bluestone, Biotechnol 
(1992) 10.: 132-134. Systems and polymers for delivery of 
polynucleotides are known in the art. P.L. Feigner, Adv 

30 Drug Delivery Rev (1990) 5:163-187. Adenoviral vectors 
have been used to transfer CFTR to the cotton rat lung in 
vivo. M. A. Hosenfeld et al . , Cell (1992) 68.: 143-155 . 
Although high levels of transfection in vivo have been 
reported with the adenoviral vectors, non-viral delivery 

3 5 systems have a number of advantages and should be 



WO 93/19768 



PCT/US93/03406 



vigorously developed. Rosenfeld et al., supra ; M.A. 

Rosenfeld et al. r Science (1991) 252:431-434. 

During the past decade, a number of methods have been 

developed to introduce functional genes into mammalian 
5 cells in vitro. These techniques are applicable to gene 

therapy if the target cells can be removed from the body, 

treated, and the transfected cells amplified and then 

returned to the patient. This option is not possible for 

CF patients. At present the best in vivo transfection 
10 efficiencies are obtained with retroviruses (Bluestone, 

supra ) and adenoviruses (Rosenfeld et al . , supra ) . However 

the efficiency is variable and a concern is that virus 

based gene delivery might cause viral infection or cancer. 

Initial human clinical trials have revealed no acute 
15 complications of retroviral vectors but the possibility of 

long-term complications mandate careful patient monitoring. 

K. Coriietta et al.. Human Gene Ther (1991) 2,: 3 -14. 

The risks of using viral based vectors and the 

conceptual advantages in using plasmid DNA constructs for 
20 gene therapy (discussed in P.L. Feigner et al., Nature 
(1991) 349:351-352) have led to a parallel development of 

various physical and chemical methods for gene transfer. 

The most intensely studied systems involve treatment of the 

cells with calcium phosphate or a cationic facilitator 
25 (Feigner et al., supra ) . Other popular methods involve DNA 

injection during physical puncture of the membrane (M.R. 

Capecchi, Cell (1980) 22:479-485) or passive uptake during 

permeabilization or abrasion of the cellular membrane 
(Feigner et al. , supra ) . Each method is intrinsically 
3 0 aggressive and applicable only in vitro. 

This invention is in the field of direct gene delivery 

that does not involve the use of viral vehicles. 

A non-viral carrier for gene delivery must be able to 

surmount many barriers: it must survive in circulation, it 
35 must be able to tairget the cell of choice, it must be able 

to introduce DNA into the cytoplasm, and it must be able to 

transport the DNA into the nucleus . 
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Masking . One concern about the direct delivery of 
genes in vivo is the ability of the polynucleotide to 
survive in circulation long enough to arrive at the desired 
cellular destination. "Masking", or protecting the poly- 
5 nucleotides is one way to address this concern. 

Microparticulates (such as the erythrocyte ghost, 
reconstituted viral envelopes and liposomes) have been used 
in part as protection in gene transfer. C. Nicolau et al., 
Crit Rev Ther Drug Carr Svs (1989) 6:239-271; R.J. Mannio 
10 et al., Biotechnioues (1988) 6:682-690. The most 

successful liposome system uses the cationic lipid reagent 
dioleyloxytrimethylammonium (DOTMA). P.L. Feigner et al., 
Proc Natl Acad Sci (USA) (1987) 84 : 7413-7417 . DOTMA is 
mixed with phosphatidylethanolamine (PE) to form the 
15 reagent Lipof ectin™. The advantage of using Lipof ectin'* is 
that the cationic liposome is simply mixed with the DNA and 
added to the cell. It is not necessary to encapsulate the 
DNA inside of the liposome with the cationic reagents. 
Lipof ectin™ has been used to transfect reporter genes into 
20 human lung epithelial cells in culture (L. Lu et al., 
Pfluaers Arch (1989) 415:198-203), to introduce the CAT 
gene into rats by intratracheal route (T.A. Hazinski et 
al., Am J Respir Cell Mol Biol (1991) 4:206-209) and to 
introduce the CAT gene into mice by the intratracheal and 
25 intravenous route (K.L. Brigham et al . , Am J Med Sci (1989) 
298 :278-281; A. Bout et al . , "Abstracts of the 1991 Cystic 
Fibrosis Conference", Abstract no. 87 (1991)). About 50% 
of the airway epithelial cells transiently expressed the S 
galactosidase reporter gene (Hazinski et al., supra) but 
3 0 the level of expression was not quantitated. When 
chloramphenicol acetyltransf erase (CAT) attached to a 
steroid sensitive promoter was transfected into rat lung, 
expression could be positively regulated by dexamethasone . 
Hazinski et al., supra . Cytotoxicity is a problem with high 
3 5 concentrations of Lipof ectin™. 

Substitutes for DOTMA include lipopolyamine (J. 
Loeffler et al., J Neurochem (1990) 54:1812-1815), 
lipophilic polylysines (X. Zhou et al . , Biochim Biophys 
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Acta. (1991) 1065:8-14 ) and a cat ionic cholesterol (X. Gao 
et al. f Biochem Biophvs Res Comm (1991) 179:280-285), 
These have been used to mediate gene transfer in culture. 
Although there is some improvement over transfection rates 
S observed with Lipofectin w (about threefold) , toxicity 
remains a problem. Studies on the mechanism responsible 
for transfection using the cationic lipids have been 
notably lacking. The past approach has been to synthesize 
different cationic lipids and try them in transf ection 

10 assays, rather than to systematically study how the 
delivery systems introduce DNA into the cell. DOTMA/PE 
liposomes can undergo bilayer fusion with anionic liposomes 
(N. Duzgunes et al., Biochem (1989) 28:9179-9184) which 
suggests that direct entry of the DNA via the plasma 

15 membrane is involved with DOTMA' s mode of action. High 
efficiency transfection using cationic lipids systems 
requires the inclusion of PE, possibly because PE can form 
intramembrane lipid intermediates which facilitate membrane 
fusion. The role of PE in membrane permeabilization and 

20 fusion has been extensively studied. E.g., M.-Z. Lai et 
al. f Biochem (1985) 24:1646-1653; H. Ellens et al . , Biochem 
(1986) 25:285-294f J. Bentz et al. , Biochem (1987) 2j6:2105- 
2116) . 

Cellular Targeting . Efficient gene transfer requires 
25 targeting of the DNA to the cell of choice. Recently, 
procedures based upon receptor mediated endocytosis have, 
been described for gene transfer.. G.Y. Wu et al . , J~ Biol 
Chem (1987) 262:4429; G.Y. Wu et al., J Biol Chem (1988) 
263:14621-14624. A cell-specific ligand-polylysine complex 
3 0 is bound to nucleic acids through charge interactions. The 
resulting complex is taken up by the target cells. Wu et 
al., supra . reported efficient transfection of the human 
hepatoma cell line HepG2 and of rat hepatocytes in vivo 
using this delivery system with asialoorosomucoid as a 
35 * ligand. Huckett et al, , Biochem Pharmacol (1990) 40.: 253- 
263 r reported stable expression of an enzymatic activity in 
HepG2 cells following insulin- directed targeting. Finally 
Wagner et al., Proc Natl Acad Sci (USA) (1990) 87.: 3410-3414 
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and (1991) 88:4255-4259 observed trans ferrin- poly cat ion - 
mediated delivery of a plasmid into the human leukemic cell 
line K-562 and subsequent expression of the encoded 
lucif erase gene. However, the described delivery systems 
5 are based upon high molecular weight targeting proteins 
linked to DNA through a polylysine linker. These large 
ligand-polycation conjugates are heterogenous in size and 
composition, not chemically well-defined, and difficult to 
prepare in a reproducible fashion (Wu et al . , supra : Wagner 
10 et al . , supra) . Moreover, in many of the receptor mediated 
systems, chloroquine or other disruptors of intracellular 
trafficking are required for high levels of transf ection. 
In one study, adenovirus has been used to enhance gene 
delivery of the receptor mediated systems. D.T. Curiel et 
15 al., Proc Nat l Acad Sci (VGA) (1991) 88:8850-8854. 

Together these studies show that genes can be delivered 
into the interior of mammalian cells by receptor mediated 
endocytosis and a fraction of the exogenous DNA escapes 
degradation, enters the nucleus, and is expressed. The 
level of expression is low, probably due to the limited 
entry of the foreign DNA into the cytoplasm. 

Charge Neutraliza tion and Membrane Permeabilization . 
Direct delivery of genes is aided by the ability to 
neutralize the large negative charge on the polynucleotide, 
25 and the (often concomitant) ability to permeabilize the 
membrane of the targeted cell. The use of polycations to 
neutralize the polynucleotide charge and aid in the 
membrane permeabilization and translocation is well known. 
Feigner, supra . Cat ionic lipids have also been used for 
3 0 this purpose. P.L. Feigner et al . , Proc Natl Acad Sci 
iUSAL (1987) 84:7413-7417; U.S. Patent No. 4,946,787 to 
Eppstein et al. Certain cationic lipids termed 

lipopolyamines and lipointercalants are also known. J. -P. 
Behr, Tet Lett (1986) 27.:5861-5864 . 
35 Subcellular Loc alization . Once the polynucleotide has 

entered the targeted cell, direct delivery of genes would 
be aided by the ability to direct the genes to the proper 
subcellular location. One obvious target for the delivery 
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of deoxyribonucleotides is the nucleus . Ligands known to 
aid in this process are nuclear localization peptides, or 
proteins containing these nuclear localization sequences. 
C. Dingwall et al. r TIBS (1991) 16:47S-481. 
5 Y. Kaneda et al., Science (1989) 243:375-378, showed 

that the transfection efficiency obtained with 
reconstituted Viral envelopes is increased when the foreign 
gene is co-delivered into the target cells with nuclear 
proteins. DNA mixed with nuclear proteins exhibit a modest 
10 increase in transfection over DNA that was mixed with 
albumin (Kaneda et al., supra ) . The assumption is that the 
DNA is incorporated into the nucleus more readily when 
proteins containing the nuclear localization sequence (NLS) 
pro-lys-lys-lys-arg-lys-val (P. A. Silver, Cell (1991) 
15 64:48 9-497) are associated with the plasmid. The NLS on a 
protein designates it for transport through the nuclear 
pore- Nuclear localization sequences of 14 amino acids 
have been attached to a variety of macromolecules and even 
gold particles (150 A diameter) and, when introduced into 
20 the cytoplasm, they are rapidly incorporated into the 
nucleus (D.R. Findlay et al- , J Cell Sci Supp (1989) 
11:225-242; Silver, supra ) . The suggestion that nuclear 
entry is rate limiting for successful, stable transfection 
is also supported by the finding that plasmid DNA 
25 micro in j ected into the cytoplasm is unable to bring about 
transfection of cells (no transfection out of 1000 
cytoplasmic injections, whereas microinjection of plasmids 
directly into the nucleus results in transfection in 
greater than 50% of the micro injected cells. Cappechi, 
3 0 supra . If the attachment of nuclear localization signals 
on the plasmid leads to transport of plasmid DNA into the 
nucleus, the transfection efficiency should increase. We 
propose a novel method to attach NLS and other ligands to 
the desired polynucleotide. 
35 * Finally r investigators have demonstrated that 

transfection efficiencies increase when DNA is condensed 
using various cat ionic proteins- T.I. Tikchonenko et al . , 
Gene (1988) 63„:321-330 ; M. Bottger et al . , Biochim Biophvs 
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Acta (1988) 950:221-228; Wagner et al . , supra . The reason 
why DNA condensation increases transfection is not readily 
apparent, it may increase cellular uptake of DNA (Wagner et 
al., supra ) but it also can decrease susceptibility of the 
5 DNA to nuclease activity which may result in higher amounts 
of intact DNA in the cell . 

Polynucleotide Association . Direct delivery of genes 
associated with one of the above-discussed classes of 
molecules, is further aided by the ability of those 

10 components to remain associated with the DNA. Wu et al . , 
supra , associated their receptor ligand with the 
polynucleotide by covalently attaching the ligand to the 
polycation polylysine. Wagner et al., supra , in addition 
to polylysine, also covalently attached the ligand to a DNA 

15 intercalator, ethidium homodimer (5 , 5' -diazadecamethylene- 
bis (3 , 8-diamino-6 -phenylphenanthridium) dichloride 
dihydrochloride) . P.E. Nielsen, Eur J Biochem (1982) 
122.: 283 -289, associated photoaf f inity labels to DNA by 
covalent attachment to 9-aminoacridine and certain bis- 

2 0 acridities . 

None of the above references describe a system for 
polynucleotide delivery aimed at multiple aspects of the 
problems involved in bringing a circulating polynucleotide 
to a targeted subcellular location of a targeted cell. 
25 This invention addresses those problems by associating the 
polynucleotide 'with a combination of one or more of the 
following functional components: DNA-masking components, 
cell recognition components, charge-neutralization and 
membrane-permeabilization components, and subcellular 

3 0 localization components. 

Summary of the Invention 

In light of the aforementioned problems of direct gene 
delivery, this invention contemplates a self -assembling 
polynucleotide delivery system utilizing a combination of 
3 5 one or more, preferably two or more of the following 
functional components: DNA-masking components, cell 
recognition components , charge-neutralization and membrane - 
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permeabilization components, and subcellular localization 
components. Each component in this system is able to 
perform its indicated function and also be capable of 
assembling or disassembling with the polynucleotide as 
5 required. For example, certain components may have to 
dissociate from the polynucleotide in order for it to 
perform its desired function. 

It., is accordingly a primary obj ect of this invention 
to provide a composition for presenting a polynucleotide to 

10 a subcellular component of a eukaryotic cell comprising the 
polynucleotide associated with a membrane-permeabilizing 
component capable of transporting the polynucleotide across 
the cytoplasmic membrane of the eukaryotic cell . 

It is another object of this invention to provide a 

15 composition for presenting a polynucleotide to the nucleus 
of a eukaryotic cell comprising the polynucleotide 
associated with a cell recognition component capable of 
recognizing the eukaryotic cell. 

It is yet another object of this invention to provide 

20 a composition for presenting a polynucleotide to the 
nucleus of a eukaryotic cell comprising the polynucleotide 
associated with both a cell recognition component capable 
of recognizing the eukaryotic cell, and a membrane- 
permeabilizing component capable of transporting the 

25 polynucleotide across the cytoplasmic membrane of the 
eukaryotic cell. 

It is a further object of this invention to provide a 
composition for presenting a polynucleotide to a 
subcellular component of a eukaryotic cell comprising the 

3 0 polynucleotide associated with a subcellular- localization 
component capable of delivering the polynucleotide from the 
cytoplasm of the eukaryotic cell to a subcellular component 
of the eukaryotic cell. 

It is still a further object of this invention to 

35 ' provide a composition for presenting a polynucleotide to a 
subcellular component of a eukaryotic cell comprising the 
polynucleotide, a cell recognition component capable of 
recognizing said eukaryotic cell, a membrane-permeabilizing 
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component capable of transporting the polynucleotide across 
the cytoplasmic membrane of said eukaryotic cell, a 
subcellular- localization component capable of delivering 
the polynucleotide from the cytoplasm of said eukaryotic 
5 cell to a subcellular component of said eukaryotic cell, 
and a masking component capable of increasing the 
circulatory half- life of the polynucleotide. 

It is another object of this invention to provide a 
component useful in self -assembling polynucleotide delivery 
10 systems having the formula 

X-Y 
I 

(CH 2 ) p 

15 C=0 

I 

Ar,-MH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein each of n and m is independently an integer of 1 
to 20, p is an integer of 0 to 20, Ar x and Ar 2 are 

20 independently selected from the group consisting of 
ethidium bromide, acridine, mitoxantrone, oxazolopyrido- 
carbazole, ellipticine and N-methyl-2 , 7-diazapyrenium, and 
derivatives thereof, X is a reactive coupling group, and Y 
is selected from the group consisting of cell surface 

25 receptor ligands, subcellular localization sequences, and 
membrane permeabilizing components . 

It is still another object of this invention to provide 
a reactive intercalating component having the formula 

X 

30 | 

(CH 2 ) p 
I 

c=o 

I 

35 Ar,-NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein each of n and m is independently an integer of 1 to 
20, p is an integer of 0 to 20, Ar, and Ar 2 are 
independently selected from the group consisting of 
ethidium bromide, acridine, mitoxantrone, oxazolopyrido- 
40 carbazole, ellipticine and N-methyl-2 , 7-diazapyrenium, and 
derivatives thereof, and X is a reactive group. 
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Brief Description of the Figures 

Figure 1 shows one embodiment of the polynucleotide 
delivery system of the invention r where NLS is a nuclear 
localization sequence, MD is a membran-permeabilization 
5 component, and Ligand is a cell recognition component. 

Figure 2 shows the structure of gramicidin 

Figure 3 compares the efficiency of luciferase 
transfection with Lipof ectin™, pH-sensitive liposomes, and 
the gramicidin S /DOPE /DNA complex. 
10 Figure 4 shows the effect of gramicidin S to DNA ratio 

on transfection efficiency. 

Figure 5 shows the effect of gramicidin S to DOPE ratio 
on transfection efficiency. 

Figure 6 shows the effect of lipid type in the 
15 gramicidin S/lipid/DNA complex on transfection efficiency. 

Figure 7 shows the effect of substituting other 
peptides for gramicidin S in the gramicidin S/lipid/DNA 
complex on transfection efficiency. 

Figure 8 shows a synthetic scheme for attaching 
20 targeting carbohydrates and/or reactive maleimide to 
spermidine bis - acridine . 

Figure 9 shows the basic scheme for coupling peptides 
to the maleimido- spermidine bis -acridine. 

Figure 10 shows a synthetic scheme for coupling to a 
25 degradable Lys-Lys peptide bis -acridine. 

Figure 11 shows the results of the gel retardation 
assay of Example 3 . 

Figure 12 shows the ability of several galactosyl bis- 
acridines to bring plasmid DNA into hepatocytes. 
3 0 Figure 13 shows a synthetic scheme for the 

trigalactosylated spermidine bis -acridine of Example 6. 

Detailed Description of the Invention 
Definitions : 

The term "polynucleotide" as used herein, includes RNA 
3 5 or DNA sequences of more than one nucleotide in either 
single chain r duplex or multiple chain form. "Poly- 
nucleotide" is generic to polydeoxyribonucleotides 
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(containing 2 ' -deoxy-D-ribose or modified forms thereof) , 
i.€., DNA, to polyribonucleotides (containing D-ribose or 
modified forms thereof), i.e., RNA, and to any other type 
of polynucleotide which is an N-glycoside or C-glycoside of 
5 a purine or pyrimidine base, or modified purine or 
pyrimidine base or abasic nucleotides. The polynucleotide 
may encode promoter regions r operator regions, structural 
regions, termination regions, combinations thereof or any 
other genetically relevant material. 

10 The polynucleotides of the invention may also contain 

one or more "substitute" linkages as is generally 
understood in the art. Some of these substitute linkages 
are non-polar and contribute to the desired ability of the 
polynucleotide to diffuse across membranes. Others 

15 contribute to the increased or decreased biodegradability 
of the polynucleotide. (Biodegradability will be affected, 
for example, by increased or decreased nuclease 
sensitivity.) These "substitute" linkages are defined 
herein as conventional alternative linkages such as 

20 phosphorothioate or phosphoramidate, are synthesized as 
described in the generally available literature. Not all 
such linkages in the same polynucleotide need be identical. 

Modifications in the sugar moiety of the 
polynucleotide, for example, wherein one or more of the 

25 hydroxyl groups are replaced with halogen, aliphatic 
groups, or f unctionalized as ethers, amines, and the like, 
or wherein the ribose or deoxyribose is replaced with other 
functionally equivalent structures, are also included. 
Modifications in the base moiety include alkylated purines 

30 or pyritnidines , acylated purines or pyrimidines, or other 
heterocycles . Such "analogous purines" and "analogous 
pyrimidines" are those generally known in the art, many of 
which are used as chemb therapeutic agents. 

In particular, the sugar-phosphate backbone of the 

3 5 polynucleotide may be replaced with a non- carbohydrate 
backbone such as a peptide or other type of polymer 
backbone as discussed in P.E. Nielsen et al . , Science 
(1991) 254 :1497-1500. 
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The term "functional component" as used herein, 
includes DNA-masking components , cell recognition 
components, charge-neutralization and membrane- 
permeabilization components, and subcellular- localization 
5 components . 

The term "DNA-masking component", as used herein, 
refers to a molecule capable of masking all or part of the 
polynucleotide, thereby increasing its circulatory half- 
life by inhibiting attack by degrading reagents (such as 
10 nucleases) present in circulation. 

The term " membrane -permeabilizing component" , as used 
herein, refers to any component that aids in the passage of 
a polynucleotide across a membrane . Thus , this term 
encompasses in part charge-neutralizing components , usually 
15 polycations, that neutralize the large negative charge on 
polynucleotides, and enable the polynucleotide to 
transverse the hydrophobic interior of a membrane. Many 
charge-neutralizing components can act as membrane- 
permeabilizers * Membrane -permeabilizat ion may also arise 
20 from amphipathic molecules. 

A membrane permeabilizer is a molecule that can assist 
a normally impermeable molecule to traverse cellular 
membranes and gain entrance to the cytoplasm of the cell. 
A membrane permeabilizer may be a peptide, bile salt, 
25 glycolipid, carbohydrate, phospholipid or detergent 
molecule. .Membrane permeabilizers often have amphipathic 
properties such that one portion is hydrophobic and another 
is hydrophilic, permitting them to interact with membranes. 
The term "liposome" as used herein, refers to small 
3 0 vesicles composed of amphipathic lipids arranged in 
spherical bilayers. Liposomes are usually classified as 
small unilamellar vesicles (SUV) , large unilamellar 
vesicles CLUV) , or mult i- lamellar vesicles (MLV) . SUVs and 
w LUVs, by definition, have only one bilayer, whereas MLVs 
3 5 * contain many concentric bilayers. Liposomes may be used to 
encapsulate various materials, by trapping hydrophilic 
molecules in the aqueous interior or between bilayers, or 
by trapping hydrophobic molecules within the bilayer. 
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Liposomes exhibit a wide variety of characteristics, 
depending upon their size, composition, and charge. For 
example, liposomes having a small percentage of unsaturated 
lipids tend to be slightly more permeable, while liposomes 
5 incorporating cholesterol or other sterols tend to be more 
rigid and less permeable. Liposomes may be positive, 
negative, or neutral in charge, depending on the 
hydrophilic group. For example, choline^based lipids 
impart an overall neutral charge, phosphate and sulfate 

10 based lipids contribute a negative charge, glycerol -based 
lipids are generally negatively- charged, and sterols are 
generally neutral in solution but have charged groups. 

The term "cell recognition component" as used herein, 
refers to a molecule capable of recognizing a component on 

15 the surface of a targeted cell. Cell recognition 
components include: antibodies to cell surface antigens, 
ligands for cell surface receptors including those involved 
in receptor-mediated endocytosis, peptide hormones, and the 
like. 

20 The term "DNA- associating moiety" refers to a molecule 

or portions thereof that interacts in a noncovalent fashion 
with nucleic acids. DNA-associating moieties include 
major- and minor-groove binders, which are molecules 
thought to interact with DNA by associating with the major 

25 or minor groove of double -stranded DNA. DNA associating 
moieties also include DNA intercalators, which are planar 
molecules or planar portions of molecules thought to 
intercalate into DNA by inserting between and parallel to 
nucleotide base pairs. DNA associating moieties further 

3 0 include polycations, thought to associate with the negative 
charges on the DNA backbone. When a single -stranded DNA or 
RNA is used as the therapeutic strand, the complementary 
"linker strand" as described herein may functionally act as 
the "DNA-associating moiety". 

35 DNA associating moieties may be covalently linked 

through a "reactive group" to a functional component of 
this invention. These reactive groups are easily reacted 
with a nucleophile on the functional component. Such 
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reactive groups (with their corresponding reactive 
nucleophiles) include, but are not limited to: N- 
hydroxysuccinimide (amine) , maleimide and maleimidophenyl 
(sulfhydryl) , pyridyl disulfide (sulfhydryl) , hydrazide 
5 (carbohydrate) , and phenylglyoxal (arginine) . 

The term "subcellular- localization component" as used 
herein,, refers to a molecule capable of recognizing a 
subcellular component in a targeted cell- Recognized 
subcellular components include the nucleus, ribosomes, 
10 mitochondria, and chloroplasts . Particular subcellular - 
localization components include the "nuclear-localization 
components" that aid in bringing molecules into the nucleus 
and are known to include the nuclear localization peptides 
and amino acid sequences . 

15 The Compositions: 

The compositions of this invention in part are self- 
assembling polynucleotide delivery systems utilizing a 
polynucleotide in combination with one or more, preferably 
two or more of the following functional components: DNA- 

20 masking components, cell recognition components, charge- 
neutralization and membrane -permeabilization components , 
and subcellular localization components. Each element in 
this system is able to perform its indicated function and 
also be capable of assembling or disassembling with the 

25 polynucleotide as required. Individual elements of this 
system, and methods and intermediates for making these 
elements are also contemplated as part of this invention. 
One embodiment of the system is shown in Figure 1. 

The Polynucleotide 

3 0 The polynucleotide may be single -stranded DNA or RNA, 

or double- stranded DNA or DNA-RNA hybrid. Triple- or 
quadruple-stranded polynucleotides with therapeutic value 
are also contemplated to be within the scope of this 
invention. Examples of double- stranded DNA would include 

35 structural genes, genes including operator control and 
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termination regions, and self -replicating systems such as 
plasmid DNA. 

Single-stranded polynucleotides include antisense 
polynucleotides (DNA and RNA) , ribozymes and triplex - 
5 forming oligonucleotides. This "therapeutic strand", in 
order to have prolonged activity, preferably has as some or 
all of the nucleotide linkages stable, non-phosphodiester 
linkages. Such linkages include, for example, the 
phosphorothioate, phosphorodithioate, phosphoroselenate, or 

10 O-alkyl phosphotriester linkages wherein the alkyl group is 
methyl or ethyl. 

For these single -stranded polynucleotides, it may be 
preferable to prepare the complementary strand to the 
therapeutic strand as part of the administered composition. 

15 This complementary strand is designated the "linker 
strand 1 ' , and is usually synthesized with a phosphodiester 
linkage so that it is degraded after entering the cell. 
The "linker strand" may be a separate strand, or it may be 
covalently attached to or a mere extension of the 

20 therapeutic strand so that the therapeutic strand 
essentially doubles back and hybridizes to itself. 

The linker strand may also have functionalities on the 
3 ' or 5 ' end or on the carbohydrate or backbone of the 
linker that serve as functional components to enhance the 

25 activity of the therapeutic strand. For example, the 
phosphodiester linker strand may contain a targeting ligand 
such as a folate derivative that permits recognition and 
internalization into the target cells. If the linker is 
attached to its complementary therapeutic strand that is 

30 composed of degradation-resistant linkages, the duplex 
would be internalized. Once inside the cell, the linker 
would be degraded, releasing the therapeutic strand. In 
this manner the therapeutic strand would have no additional 
f unctionalites attached and its function would not be 

35 impeded by non-essential moieties. This strategy could be 
applied to any antisense, ribozyme or triplex- forming 
polynucleotide. It would be used to deliver antiviral, 
antibacterial , antineoplastic , antiinflammatory, 
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antiproliferative, anti -receptor blocking or anti -transport 
polynucleotides and the like. 

A separate "linker strand" may be synthesized to have 
the direct complementary sequence to the therapeutic strand 
5 and hybridize in a one-on-one fashion. Alternatively, the 
linker strand may be constructed so that the 5' region of 
the linker strand hybridizes to the 5' region of. the 
therapeutic strand, and the 3' region of the linker strand 
hybridizes to the 3' region of the therapeutic strand to 
10 form a concatenate of the following structure 

5' 

3' 

This concatenate has the advantage that the apparent 
molecular weight of the therapeutic nucleic acids is 
15 increased and its pharmacokinetic properties and targeting 
ligand: therapeutic ligand ratio can be adjusted to achieve 
the optimal therapeutic effect. 

The Functional Components 

DNA-Maskincr Components . The DNA-masking element of 

20 this system is a molecule capable of masking all or part of 
the polynucleotide r thereby increasing its circulatory 
half -life by inhibiting attack by degrading reagents 
present in circulation. 

In this invention, polyethylene glycol (PEG) can be 

25 covalently linked with a DNA-associating moiety by 
conventional methods as described below, and used as a DNA- 
masking component. The PEG will have a molecular weight 
from about 700 to about 20 r 000 daltons, preferably about 
1800 to 6000 daltons, and is preferably present in a ratio 

3 0 (molecules PEG : bp DNA) from about 1:4 to 1:100, more 
preferably about 1:20. 

Alternatively r DNA may be masked through association 
with lipids. In one embodiment, the DNA is encased in 
standard liposomes as described, for example, in U.S. 

35 Patent No. 4,394,448 to Szoka et al., the specification of 
which is hereby incorporated by reference. In another 
embodiment, the DNA is incubated with a synthetic cationic 
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lipid similar to those described in U.S. Patent No. 
4,897,355 to Eppstein et al . These cationic lipids have 
the general formula 

.R'o-ch,. >R , 

R^O-CH — tCU^ „, - 

wherein n is an integer from 1 to 8 , R 1 and R 2 are the same 
5 or different and are alkyl or alkenyl having from 6 to 24 
carbon atoms, R 3 is hydrogen, alkyl or alkylamine having 
from 1 to 10 carbon atoms, and R 4 is a positively charged 
linear or branched alkyl or alkylamine having from 1 to 3 0 
carbon atoms, wherein one or more of the carbon atoms may 

10 be substituted with NR' , wherein R' is hydrogen, alkyl or 
alkylamine having from 1 to 10 carbons. Preferred groups 
that can function as the -N-R' moiety are 
tris (aminoethyl) amine (NH 2 CH 2 GH 2 ) 3 N, agmatine 

(decarboxyarginine ) ( CH 2 ) 4 C ( =NH) NH 2 , 3 - aminoethyl -1,3- 

15 propanediamine H 2 N (CH 2 ) 3 NH (CH 2 ) 2 NH 2t 

3 - dimethyl am inopropylamine ( CH 3 ) 2 NH ( CH 2 ) 3 NH 2 , 
iminobis (N , N ' ) dime thylpropyl amine NH ( ( CHj) 3 N ( CH 3 ) 2 ) 2 , 
iminobis (3-aminopropyl) -1, 3 -propanediamine, 1,4 -bis (3- 
aminopropyl ) piperazine , bis (propylamine ) (NH 2 ( CH 2 ) 3 ) 2 NH , 

2 0 spermidine, and spermine, wherein these groups are attached 
to the lipid molecule through one of their nitrogen atoms. 

In a specifically preferred embodiment, the 
synthetic cationic lipid is a synthetic cationic tail lipid 
having the formula 

Z -C-O -CH ? 

o 

25 wherein n is an integer from 6 to 24, Y is selected from 
the group consisting of hydrogen, ethanolamine, choline, 
glycerol, serine and inositol, R 1 is alkyl or alkenyl 
having from 6 to 24 carbon atoms, R 3 is hydrogen, alkyl or 
alkylamine having from l to 10 carbon atoms, and R 4 is a 
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positively charged linear or branched alkyl or alkylamine 
having from 1 to 3 0 carbon atoms, wherein one or more of 
the carbon atoms may be substituted with NR' , wherein R' is 
hydrogen, alkyl or alkylamine having from 1 to 10 carbons. 
5 Preferred groups that can function as the -N-R' moiety are 
tris (amino ethyl) amine (NH 2 CH 2 CH 2 ) 3 N , agmatine 
(decarboxyarginine) ,H 2 N (CH 2 ) 4 C (=NH) NH 2 , 3 -aminoethyl-1 , 3 - 
propanediamine K^tf ( CH 2 ) 3 NH ( CH 2 ) 2 NH 2 

3 - dime thy 1 am in op ropy 1 ami ne ( CH 3 ) 2 NH ( CH 2 ) 3 NH 2 , 

1 0 iminobis (N , N' ) dimethylpropylamine NH ( ( CH 2 ) 3 N (CH 3 ) 2 ) 2 , 

iminobis (3-aminopropyl) -1, 3-propanediamine, 1, 4-bis (3- 
aminopropyl) piperazine, bis (propylamine) (NH 2 (CH 2 > 3 ) ^NH, 
spermidine, and spermine, wherein these groups are attached 
to the lipid molecule through one of their nitrogen atoms. 

15 It has been found that the above-described synthetic 

cationic lipids effectively mask the DNA when associated 
therewith. Without attempting to limit the invention in 
any way, it is believed that the lipids form a monolayer 
structure that encapsulates the DNA in some fashion. 

20 Cell Recognition Components , The cell recognition 

element of this system is a molecule capable of recognizing 
a component on the surface of a targeted cell, covalently 
linked with a DNA-associating moiety by conventional 
methods as described below. Cell recognition components 

25 include: antibodies to cell surface antigens, ligands for 
cell surface receptors including those involved in 
receptor-mediated endocytosis, peptide hormones, etc. 
Specific ligands contemplated by this invention include: 
carbohydrate ligands such as galactose, mannose, mannosyl 

30 5-phosphate, fucose, sialic groups, N-acetylglucosamine or 
combinations of these groups as complex carbohydrates such 
as those found on glycolipids of the blood groups or on 
various secreted proteins. Other ligands include folate, 
bio tin, various peptides that can interact with cell 

3 5 surface or intracellular receptors such as the 
chemoattractant peptide N-f ormyl-met-leu-phe, peptides 
containing the arg-asp-glycine sequence or cys-ser-gly-arg- 
glu-asp-val-trp peptides, peptides that contain a cystine 
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residue or that interact with cell surface protein such as 
the human immunodeficiency virus GP-120, and peptides that 
interact with CD-4 . Other ligands include antibodies or 
antibody fragments such as described by A. Hertler and 
5 A. Frankel, J Clin Oncol 7 : 1932-1942. The specif icity of 
the antibodies can be directed against a variety of 
epitopes that can be expressed on cell surfaces including 
histocompatability macromolecules , autoimmune antigens, 
viral, parasitic- or bacterial proteins. Other protein 

10 ligands include hormones such as growth hormone and insulin 
or protein growth factors such as GM-CSF, G-CSF, 
erythropoetin, epidermal growth factor, basic and acidic 
fibroblast growth factor and the like. Other protein 
ligands would include various cytokines that work through 

15 cell surface receptors such as interleukin 2, 
interleukin 1, tumor necrosis factor and suitable peptide 
fragments from such macromolecules . 

Membrane-Permeabilizina Components . The membrane - 
permeabilizing element of this system is a molecule that 

20 aids in the passage of a polynucleotide across a membrane. 

The liposomes and synthetic cationic lipids described above 
as DNA-masking components also may function as membrane- 
permeabilization components. 

The membrane -permeabilizing components of this 

25 invention also include polycations that neutralize the 
large negative charge on polynucleotides . Polycations of 
this invention include polylysine, polyarginine, poly 
(lysine-arginine) and similar polypeptides, and the 
polyamines. Another class of polycations are the cationic 

3 0 bile salts having the following formula: 



HO 
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wherein X and Y are independently H or OH, R 3 is hydrogen, 
alkyl or alkylamine having from 1 to 10 carbon atoms, and 
R* is a positively charged linear or branched alkyl or 
alkylamine having from 1 to 3 0 carbon atoms, wherein one or 
5 more of the carbon atoms may be substituted with NR r , 
wherein R r is hydrogen, alkyl or alkylamine having from 1 
to 10 carbons. Preferred groups that can function as the - 
N-R' moiety are tr is (aminoethyl) amine (NH 2 CH 2 CH 2 ) 3 N", agmatine 
( decarboxyarginine ) Hjtf ( CH 2 ) 4 C C =NH) NH 2 , 3 - aminoethyl -1,3- 

10 propanediamine H2N(CH 2 ) 3 NHCCH 2 ) 2 NH 2 ^ 

3 - dimethyl aminop ropy 1 amine ( CH 3 ) 2 NH ( CH 2 ) 3 NH 2 , 
iminobis (N,N' ) dimethylpropylamine NH ( (CH 2 ) 3 N (CH 3 ) 2 ) 2/ 
iminobis ( 3 - aminopropyl ) - 1 , 3 -propanediamine , 1 , 4 -bi s ( 3 - 
aminopropyl ) piperazine r bis (propylamine } (NH 2 ( CH 2 ) 3 ) 2 NH , 

15 spermidine , and spermine, wherein these groups are attached 
to the bile salt through one of their nitrogen atoms. 

In a different embodiment, the membrane- 
permeabilizing component of the invention is an amphipathic 
cationic peptide. Amphipathic cat ionic peptides are 

20 peptides whose native configuration is such that the 
peptide is considered to have a cationic face and a 
neutral, hydrophobic face. In a preferred embodiment, the 
peptide is a cyclic peptide. Examples of the amphipathic 
cationic cyclic peptides of this invention are gramicidin 

25 S (the structure of which is shown in Figure 2) , and the 
tyrocidines* The peptide may also contain some or all of 
the amino acids in the d configuration as opposed to the 
naturally occurring I* configuration. 

In a particularly preferred embodiment, the membrane- 

3 0 permeabilizing element includes, in addition to the 
amphipathic cationic cyclic peptides, either (1) a lipid, 
or (2) a simple polyamine, or both. 

The lipid of the invention is an amphipathic molecule 
which is capable of liposome formation, and is 

35 substantially non- toxic when administered at the necessary 
concentrations either in native form or as liposomes. 
Suitable lipids generally have a polar or hydrophilic end, 
and a non-polar or hydrophobic end. Suitable lipids 
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include without limitation egg phosphatidylcholine (EPC) , 
phosphatidylethanolamine, dipalmitoylphosphat idyl choline 
(DPPC) , cholesterol (Choi) , cholesterylphosphoryl choline, 
3,6, 9-trioxaoctan-l-ol-cholesteryl-3e-ol, dimyristoyl- 
5 phosphatidylcholine (DMPC) , and other hydroxy- cholesterol 
or aminocholesterol derivatives (see, e.g., K.R. Patel et 
Biochim Biophvs Acta (1.985) 814:256-64) . The lipid is 
preferably added in the form of liposomes. 

The added polyamine is preferably spermine or 
10 spermidine. 

The membrane permeabilizing elements the cyclic 
peptide and optional phospholipid and polyamine may be 
added to the composition simultaneously or consecutively. 
Preferably, the cyclic peptide is added first, and the 
15 phospholipid or polyamine added later. The molar ratio of 
added cyclic peptide to added polyamine is preferably from 
about 1:1 to about 1:3. The molar ratio of added cyclic 
peptide to added phospholipid is preferably from about 1:1 
to about 1:20. 

20 Subcell ular-Localization Components . The 

subcellular- localization element of this system is a 
molecule capable of recognizing a subcellular component in 
a targeted cell, covalently linked with a DNA-associating 
moiety by conventional methods as described below. 

25 Particular subcellular components include the nucleus, 
ribosomes, mitochondria, and chloroplasts . 

In a preferred embodiment of this invent ion , the 
subcellular-localization component is a nuclear- 
localization component . The nuclear-localization 

3 0 components include known peptides of defined amino acid 
sequences, and longer sequences containing these peptides. 
One known peptide sequence is the SV 4 0 large T antigen 
heptapeptide pro-lys-lys-lys-arg-lys-val . Other peptides 
include the influenza virus nucleoprotein decapeptide ala- 

35 ala-phe-glu-asp-leu-arg-val-leu-ser , and the adenovirus Ela 
protein sequence lys-arg-pro-arg-pro . Other sequences may 
be discerned from C. Dingwall et al . , TIBS (1991) 16 :478- 
481. 
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In another embodiment, the subcellular- localization 
component is a lysosomal- localization component. A known 
component for targeting the lysosome is a peptide 
containing the sequence lys-phe-glu-arg-gln. In yet 
5 another embodiment, the subcellular- localization component 
is a mitochondrial-localization component. A known 
component for targeting mitochondria is a peptide 
containing the s equence met - 1 eu - ser- leu - ar g- gin- s er - i le - 
arg-phe-phe - lys -pro - ala- thr- arg . 
10 DNA- Associating- Moieties 

The DNA-associating moiety of this system refers to 
a portion of a functional component that interacts in a 
noncovalent fashion with nucleic acids. The moiety is 
covalently linked to the rest of the functional component 
15 by conventional means or as described below. DNA- 
associating moieties are preferably major- and minor-groove 
binders, DNA intercalators , or general DNA binders. In the 
case of single- stranded polynucleotides, the DNA- 
associating moiety may even be the linker strand as 
20 described above. In such a case the functional moiety r 
such as the cell-recognition or subcellular-localization 
component is covalently linked to the linker strand. 

In one preferred embodiment, the DNA-associating 
moiety is a major- or minor-groove binder. The major- and 
25 minor-groove binders are moieties known to associate or 
"lay in" the major or minor groove of DNA. These binders 
include distamycin A and Hoechst dye 33258. 

In another embodiment, the DNA-associating moiety is 
a nonspecific DNA binder such as a polycation. Polycations 
3 0 of this invention include polylysine, polyarginine, poly 
(lysine-arginine) and- similar polypeptides, and the 
polyamines . 

In another preferred embodiment, the DNA-associating 
moiety is a DNA intercalator . DNA intercalators are planar 
3 5 * polycyclic molecules such as ethidium bromide, acridine, 
mitoxantrone, oxazolopyridocarbazole , ellipticine and N- 
methyl-2, 7-diazapyrenium, and derivatives thereof. In a 
particular preferred embodiment, the intercalator is a 
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dimer consisting of two covaiently linked planar polycyclic 
molecules. A planar polycyclic dimer moiety of this 
invention has the structure 

Z 

5 I 

CCH 2 ) p 

C=0 
I 

10 Ar r NH (CH 2 ) n -N- (CH 2 ) ra -NH-Ar 2 

wherein Z is a bond, each of n and m is independently 

an integer of 1 to 20, and p is an integer of 0 to 20; and 
Ar t and Ar 2 are independently selected from the group 
consisting of ethidium bromide, acridine, mitoxantrone , 

15 oxazolopyridocarbazole, ellipticine and N-methyl-2 , 7- 
diazapyrenium, and derivatives thereof . 

The values of n and m are important as they determine 
the spacing of the intercalated acridine monomers in the 
DNA. More preferred values of n and m are 3 and 4, 

20 respectively. Bis-acridine dimers, wherein Ar, and Ar 2 are 
both acridine, are preferred. 

This preferred DNA-associating moiety will be 
covaiently attached to a functional moiety, said moiety 
being a cell recognition moiety, subcellular localization 

25 moiety, or membrane permeabilizing moiety as described 
above. The value of p determines the separation of the 
intercalator from the functional moiety. Preferred values 
for p are from 0 to 8 . 

The DNA-associating moiety may be covaiently attached 

3 0 to multiple copies of, or more than one functional moiety. 

For example, the bis-acridine dimer may be attached to 
three galactose residues that bind to the hepatocyte 
asialoorosomucoid receptor as shown in Figure 13 . 

A preferred method for attaching the DNA-associating 

3 5 dimer to the functional moiety involves a precursor having 
the formula 
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X 
I 

(CH,) P 
I " 

I 

Ar^NH- (CH 2 > n -N- (CH 2 ) m -NH-Ar 2 

wherein each of n and m is independently an integer of 1 to 

20 r and p is an integer of 0 to 20; 
10 Ar t and Ar 2 are independently selected from the group 

consisting of ethidium bromide, acridine, mitoxantrone , 

oxazolopyridocarbazole, ellipticine and N-methyl-2, 7- 

diazapyrenium , and derivatives thereof; and 

X is a reactive group selected from the group 
15 consisting of N-hydroxysuccinimide , maleimide, 

maleimidophenyl , pyridyl disulfide , hydrazide, and 

phenylglyoxal . 

In a preferred embodiment r Ar Y and Ar 2 are acridine, 

p is 4 and X is p-maleimidophenyl . This intercalating 
20 moiety is then coupled to the functional moiety through a 

sulfhydryl group on the functional moiety, for example, to 

obtain a bifunctional component having the structure 

X-Y 
t 

25 (CH 2 ) p 

C=0 
I 

Ar l -NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

3 0 wherein Y is a functional component; 

each of n and m is independently an integer of 1 to 
20, and p is an integer of 0 to 20; 

and Ar 2 are independently selected from the group 
consisting of ethidium bromide, acridine, mitoxantrone, 
35 oxazolopyridocarbazole, ellipticine and N-methyl-2 , 7- 
diazapyrenium, and derivatives thereof; and 

X is a reactive group selected from the group 
consisting of N-hydroxysuccinimide, maleimide, 
maleimidophenyl, pyridyl disulfide,, hydrazide, and 

4 0 phenylglyoxal . 
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Biodegradable linkers such as peptides having the 
sequence -lys-lys- may also be used in attaching the 
functional component to the intercalator. 

In yet another embodiment of this invention, the 
5 planar polycyclic dimer has the formula 

(aa) x -aai- (aa) y -aa 2 - (aa) z 

N l H N^H 

10 Ar x Ar 2 

wherein Ar t and Ar 2 are independently selected from the 
group consisting of ethidium bromide, acridine, 
mitoxantrone, oxazolopyridocarbazole, ellipticine and N- 
methyl-2, 7-diazapyrenium, and derivatives thereof; 
i5 each aa is independently an amino acid; 

x and z are integers independently selected from 1 
to 100; 

y is an integer from 0 to 5; 
aa. x and aa 2 are lysine residues ; 
20 N 1 and N 2 are nitrogens from the e- amino groups of 

lysine residues aaj and aa 2 . 

Utility of the Polynucleotide Delivery System 

The polynucleotide delivery system of the invention 
is useful in a therapeutic context. In therapeutic 

25 applications, the system of the invention can be formulated 
for a variety of modes of administration, including 
systemic and topical or localized administration. 
Techniques and formulations generally may be found in 
Remingtons Pharmaceutic al Sciences . Mack Publishing Co., 

3 0 Easton, PA, latest edition. 

For systemic administration, parenteral 
administration such as injection is preferred, including 
intramuscular , intravenous , intraperitoneal , and 
subcutaneous. For treating disorders of the lung, 

35 * administration of the polynucleotide delivery system is 
done by inhalation or installation of the system directly 
into the lung. 
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For injection, the systems of the invention axe 
formulated in liquid solutions, preferably in 
physiologically compatible buffers such as Hank' s solution 
or Ringer's solution. In addition, the systems may be 
5 formulated in solid form and redissolved or suspended 
immediately prior to use. Lyophilized forms are also 
included. 

Systemic administration can also be by transmucosal 
or transdermal means, or the systems can be administered 

10 orally, or through intranasal or inhaled aerosols. For 
transmucosal or transdermal administration, penetrants 
appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the 
art, and include, for example, for transmucosal 

15 administration bile salts and f usidic acid derivatives . In 
addition* detergents may be used to facilitate permeation. 
Transmucosal administration may be through nasal sprays, 
for example, or using suppositories. For oral 

administration, the systems are formulated into 

2 0 conventional oral administration forms such as capsules, 

tablets, and tonics. 

For topical administration, the systems of the 
invention are formulated into ointments, salves, gels, or 
creams, as is generally known in the art. 
25 The following examples are meant to illustrate, but 

not to limit the invention. 

Example 1 
Gramicidin S Transfection 
I*ipofectin w is a synthetic cat ionic lipid, 
30 dioleyloxytrimethylammonium, (DOTMA) in combination with 
phosphatidylethanolamine to form a charge complex with the 
negatively charged DNA. This complex is thought to fuse 
with the cell membrane and deliver DNA into the cytoplasm. 
An alternative approach uses pH sensitive liposomes 

3 5 composed of a negatively charged lipid and 

phosphatidylethanolamine. C.Y. Wang et al., Biochem (1989) 
28.: 9508-9514 . The delivery mechanism involves endocytosis 
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of the liposome, as the pH in the endosome becomes acidic, 
the liposomal biiayer destabilizes and fuses with the 
endosomal membrane. The contents of the liposome are then 
introduced into the cytoplasm of the cell. C.-J. Chu et 
5 al., Pharmaceut Res (1990) 7:824-834. 

We have compared Lipofectin™ to a pH- sensitive 
cholesterylhemisuccinate (Chems) /phosphatidylethanolamine 
(PE) liposome composition and to gramicidin S / dioleoyl- 
phosphatidylethanolamine (DOPE) / DNA complexes for the 

10 delivery and expression of DNA in mammalian cells. 

Plasmids containing strong promoters and either firefly 
luciferase or 0 galactosidase were used as indicators for 
gene transfer. 
Cell transfection protocol . 

15 CV-1, p3 8 8Dl, HepG2 and HeLa cells were provided by 

the UCSF Cell Culture Facility. The Lipofectin™ reagent 
was used as described in the product insert (Gibco-BRL, 
Gaithersburg, MD) . KD83 cells were obtained from DNAX 
(Palo Alto r CA) . Cells were plated at a density of 0.5- 

20 1X10 6 cells per 60 mm dish and grown 16 to 20 hrs at 37C 
under 5% C0 2 in appropriate media containing 10% fetal calf 
serum (FCS) . Prior to incubation either with liposomes, 
Lipofectin™, or the gramicidin S/DOPE/DNA complex, cells 
were washed once with 2 ml of FCS-free DME H-21 medium. 

25 The transfection system was then added in 2 ml of the same 
media. In some experiments, transfection took place in 10% 
FCS containing DME H-21. After 5 hrs. media was removed 
and replaced by 3 ml of appropriate media with 10% FCS. 
Luciferase activity was measured after 48 hrs as described 

30 (A.R. Brasier et al . , Biotechnioues (1989) 7:1116-1122). 

Briefly, cells were washed twice with ice-cold phosphate 
buffer saline without Ca 2+ and Mg 2+ (PBS) , treated with 40 0 
111 of 25 mM glycylglycine (pH 7.8) in lysis buffer 
(containing 1% Triton) and scraped. After cent rifugat ion, 

3 5 100 /xl of supernatant were mixed with an optimal amount of 
50 mM ATP. D-luciferin (Sigma, 100 fil of a ImM solution) 
was then injected and the emitted light was integrated 
during the first 10 sec . using a bioluminometer 
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(Bioluminescence Analytical Laboratories Inc. , San Diego, 
CA) . Proteins in the supernatant were assayed using the 
technique of Bradford (Bio-Rad kit) . Results were 
expressed as light units per mg of cell -protein. 
5 Luciferase Assay 

In order to compare the potency of three different 
viral luciferase gene promoters, RSV, SV40 and CMV, we have 
transfected several mammalian cell lines with the 
corresponding Lipof ectin 1 * complexed-plasmids . Each dish of 

10 cells received 2 fil of plasmid combined with 10 ptl of 
Lipof ectin** as described above. Promoter strength was 
estimated by the luciferase expression at 48 hr given by 
the corresponding plasmid- The CMV promoter (pCluc4 
plasmid) led to the highest luciferase expression in HeLa, 

15 HepG2 and p388Dl cells, while SV40 promoter (pSV2 plasmid) 
was more potent in CV-1 cells . Therefore for further 
experiments, pSV2 plasmid has been used in CV-1 cells and 
pCluc4 in other cell-lines. 
Liposome characterization 

20 Plasmid encapsulation efficiency was determined after 

separation of encapsulated from non-encapsulated plasmid on 
Ficoll gradients. About 22 ± 3% of the total DNA added was 
encapsulated. Liposome diameter was measured by dynamic 
light scattering and were 372— ±38 nm, 295±65 nm and 464±20 

25 nm for DOPE/CHEMS, DOPC/CHEMS and PS/Choi liposomes 
respectively (results are the mean±SD of three independent 
light scattering determinations) . 

A. Gramicidin S and Phosp hatidvlethanolamine 

Typical complex preparation was made by diluting 20 

3 0 /ig of plasmid DNA in 300 /il of 30 mM Tris Cl pH 9 in a 
polystyrene tube. Gramicidin S was diluted in 3 0 mM pH 9 
Tris Cl buffer to a concentration of 2 mg/ml from a stock 
solution at 20 mg/ml in DMSO. 20 fil of diluted gramicidin 
S (i.e. 40 fxg) solution was added to the DNA and quickly 

3 5 mixed. Then 170 nmoles of liposomes were added slowly drop 
by drop to the DNA gramicidin S mixture. Liposomes were 
prepared by drying 4 pmoles of lipids under nitrogen with 
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a rotavapor and by rehydrating the film with 4 ml of 3 0 mM 
pH 9 Tris Cl buffer. Liposomes were subsequently sonicated 

3 0 min under argon using a bath sonicator. The diameter of 
the complex was determined by dynamic light scattering. 

5 Other peptides including, tyrocidine (U.S. Biochemicals) , 
polymyxin B (Sigma) and polylysine 100 (Sigma) were also 
used to form the complex with DNA and lipids. 

The efficiency of transfection was monitored by 
measuring the expression of lucif erase in CV-1 cells as 

10 described above. The dose response comparing the amount of 
DNA added in the three transfection systems is illustrated 
in Figure 3 . Light units per mg cell protein in a log 
scale are plotted on the Y axis and the amount of DNA added 
on the X axis. The open box designate results using the 

15 Gramicidin S-dioleoylphosphatidyl ethanolamine-DNA complex . 

This complex induces a 10 fold greater level of expression 
than obtained with Lipofectin, and a 1000 to 10, 000 fold 
greater level of expression than obtained using the pH 
sensitive liposomes . 

20 B. Gramicidin S-DNA Ratio Effects 

The gramicidin S-DOPE-DNA complex was prepared as 
described in Example 1-A except the amount of gramicidin S 
added to the complex was varied at constant amounts of DNA 
(20 ug) and DOPE (170 nmoles) . The complex was added to 

25 CV-l cells and the lucif erase activity measured as 
described in Example 1. The result is presented in Figure 

4 and illustrates that maximum expression using the 
gramicidin S-DOPE-DNA complex occurs when the charge on the 
DNA is neutralized by the charge on the gramicidin. 

3 0 C . Lipid Concentration Effects 

The gramicidin S -DOPE -DNA complex was prepared as 
described in Example 1 except the amount of DOPE added to 
the complex was varied at constant amounts of DNA (20 ug) 
and gramicidin S (40 /xg) . The complex was added to CV-1 

35 * cells and the lucif erase activity measured as described in 
Example 1. The result is presented in Figure 5, which 
illustrates that in the absence of the DOPE, expression is 
low. Maximum expression using the gramicidin S-DOPE-DNA 
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complex occurs when the ratio of DOPE to gramicidin S is 
above 5/l:mole/mole. 

D. Lipid Type Effects 

The gramicidin S-lipid-DNA complex was prepared as 
5 described in Example 1 except the type of phospholipid 
added to the complex was varied at constant amounts of DNA 
(20 ug) and gramicidin S(40 ug) . The lipid compositions 
employed were DOPE; DOPE: dioleoylphosphatidylcholine 
(DOPC) : 2 /l , palmitoyloleoylphosphatidylethanolamine ( POPE ) , 

10 monomethyl DOPE (mmDOPE) ; dimethyl DOPE (dm DOPE) ; DOPC and 
dipalmitoylphosphatidylethanolamine (DPPE) . The complex 
was added to CV-1 cells and the lucif erase activity 
measured as described in Example 1. The result is 
presented in Figure 6, which illustrates that expression of 

15 luciferase activity is maximal with DOPE or a mixture of 
DOPE/DOPC:2/l in the complex. Iiucif erase activity is 
appreciably diminished when the amino group on the DOPE is 
substituted with 2 (dm DOPE) or 3 methyl groups (DOPC) . 
Expression of the encoded gene is also appreciably reduced 

20 when DPPE is used. This latter lipid has saturated acyl 
chains and a high transition temperature, which means the 
acyl chains of DPPE are less fluid than the other lipids 
tested in this series. 

E. Effects of Added Non-Amphipathic Positively Charged 
25 Spermidine 

The data presented in Example 2 show that gene 
expression due to the gramicidin S -DOPE -DNA complex is 
maximal when the negative charges on DNA are neutralized by 
the positive charges on gramicidin. To determine whether 

30 charge neutralization or membrane permeabilization is more 
important for gene transfer using this system, the positive 
charge contribution from gramicidin S was incrementally 
replaced by the positively charged polyamine, spermidine. 
The gramicidin S-lipid-DNA complex was prepared as 

35 described in Example 1 except the amount of gramicidin S 
added to the complex was varied at constant amounts of DNA 
(20 ug) . The requisite positive charges required to 
neutralize the DNA was supplied by spermidine , The complex 
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was prepared with or without 170 nmoles of DOPE. The 
complex was added to CV-1 cells and the lucif erase activity 
measured as described in Example 1. The results are given 
in Table 1 below, with luciferase activity expressed as 
light units/mg cell protein. First activity was always 
greater when DOPE was present in the complex. In the 
absence of DOPE, sequential replacement of positive charge 
due to gramicidin S by spermidine leads to a biphasic 
response; the expression of luciferase initially increase 
to a value about 10 0 fold less than the maximal response 
obtained in the presence of DOPE. When the percent of 
charge neutralization due to gramicidin S dropped below 25% 
transfection activity was totally lost. Thus, membrane 
permeabilization function of gramicidin S is more important 
than the charge neutralization function. 

Table 1 

Spermidine Charge Neutralization 



% charges 




w/o lipids 


with lipids 


brought by GS 




100 




4.5 ± 2 1 03 


8.5 ± 0.7 1 08 


75 




4 ± 2.5 105 


5 ± 2 108 


25 




2 ± 2.5 10 6 


2 ± 0.5 10 7 


12.5 




0 


2 ± 0.5 10 7 


?. Use 


of Other 


Positivelv Charaed 


Peptides 


The 


peptide 


-DOPE-DNA complex 


was prepared as 



described in Example 1 except the type of peptide added to 
the complex was varied at constant amounts of DNA (20 ug) 
and DOPE (170 nmoles) . The peptides employed were 



10 



15 
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polymixin B, a cyclic cationic peptide; polylysine, a 
linear cationic peptide; tyrocidine, a cyclic cationic 
peptide with a similar structure to gramicidin S but 
containing only a single positive charge and gramicidin S. 
5 The lucif erase plasmid was also trans fee ted into the cells 
using Lipofectin. The complex was added to CV-1 cells and 
the lucif erase activity measured as described in Example 1. 
Figure 7 shows that gramicidin S induced the greatest level 
of expression followed closely by the related cyclic 

10 peptide tyrocidine. Both cyclic peptides were superior to 
Lipofectin at transferring the DNA into cells. Activity 
was also seen with the other two peptides , polymixin B and 
polylysine, however, the level of lucif erase expression 
mediated by these two cationic peptides was inferior to 

15 that induced by gramicidin S or tyrocidine. 

G. Comparison of Transfection Mediated bv a DNA- 

Dendrimer Complex to That Obtained with a DNA- Poly lysine 
Complex 

To find better chemically-defined alternatives to the 

20 polyamine polymers such a polylysine, we have employed the 
hydrophyllic branched polycation macromolecules also known 
as the Starburst 7 * Dendrimer microparticles , Tomalia et al . r 
supra , to form a complex with DNA or with DNA and the 
permeabilizing amphipathic peptide GALA. R. Parente et 

25 al., Bichemistrv (1990) 22:8720-8728. The complex was 
prepared by diluting 12 jig of pCLUC4 plasmid in 660 fi± of 
HBS (20 mM Hepes r 150 mM NaCl, pH 7.4) in a polystyrene 
tube- Polylysine (Sigma Chemical Co.) or Starburst™ 
Dendrimer microparticles of the fifth generation (1 nmole) 

30 (Polysciences, Inc.) was dissolved in 340 /ii of HBS and 

added slowly (dropwise) to the DNA solution. In these 
conditions, the positive charges from the epsilon amino 
groups of the polylysines or from the peripheral amines of 
the dendrimer s are in 1.3 -fold excess over the negative 

35 charges of the plasmids. When the peptide GALA wa added, 
it was added so that the negative charges on GALA 
neutralized the excess charges on the dendrimer. The 
mixture was left to stand for thirty minutes after the last 
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addition at room temperature and then 500 fi± of the mixture 
was added to CV-1 cells. The transf ection. protocol was 
carried out as described above. In this experiment, the 
best transf ection protocol was accomplished with the GALA- 
5 dendrimer-DNA complex, followed by the denrimer-DNA and 
then by polylysine-DNA (Table 2) . 



Table 2 

DNA-Dendrimer Mediated Transfection 
Condition Luciferase lights 

10 (units per mg cell protein) 



Dendrimer-GA1*A-DNA (9 ± 2) x 10 s (n = 2) 

Dendrimer-DNA (5 ± 2) x 10 5 (n = 2) 

Polylysine-DNA (2.7 ± 0.1) x 10 s (n = 2) 



Example 2 

15 Synthesis of Reactive and Functionalized 

Spermidine Bis-Acridines 
Spermidine bis-acridine derivatives (synthesis shown 
in Figure 8) intercalate into double stranded nucleic acids 
with affinity constants greater than 1 X 10 4 (pH 7.4; 0.2M 

20 NaCl) and can be used to attach a variety of targeting 
molecules to DNA. Carbohydrates, peptides, hormones, 
vitamins, cof actors , proteins or antibodies can all be used 
as targeting ligands. 
A. Spermidine Bis-Acridine 

25 The scheme for directing nucleic acids to certain 

sites of the body is based upon the intercalation of a 
ligand which interacts with a cell surface componentinto 
the double stranded DNA. A procedure for selective N 4 - 
acylation of spermidine, using N T ,N 8 -bis (t-butoxycarbonyl) 

3 0 spermidine as starting material, has been reported. R.J. 

Bergeron et al.. Synthesis (1982) 689-692. We have used 
this procedure (Figure 8) to link the acid functionalized 
galactosyl derivatives 9 and 9' to the secondary amino 
group of NS^-Boc-protected spermidine (15) and the 

35 resulting galactosylated spermidines 17 and 17' were, after 
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deprotection, further alkylated with 9 -phenoxyacridine by 
a standard chemical procedure to transform them into bis- 
intercalator compounds 21 and 21 r . The synthesis of the 
carboxylic acid functionalized galactosyl derivatives is 
5 detailed (J. Haensler et al., Biochim Biophvs Acta (1988) 
946 x95-105) and is easily applicable to a wide range of 
carbohydrate ligands (M.M. Ponpipom et al., J Med Chem 
(1981) 24 11388-13 95) . The title compounds were obtained in 
an overall yield of 3 0% and the NMR and mass spectrometry 
10 data are consistent with the proposed structure, 

B. Activated Spermidine Bis -Acri dine 

Based upon the above scheme, a versatile method for 
attaching peptides to a spermidine bis-acridine derivative, 
has been developed. N l r N 8 -bis (t-butoxycarbonyl) spermidine 

15 was N 4 -acylated with N-succinimidyl-4 - (p- 
maleimidophenyl)butyrate (SMPB) (4) , deprotected and 
coupled to acridine rings to make a bis-intercalator 
bearing a maleimide group (20) . A single compound was 
obtained after chromatographic purification on silicic acid 

20 in 25% overall yield- The NMR and mass spectrometry 
results axe consistent with the assigned structure. 

C. Spermidine Bis -Acridine Linked to a NLS 

The NLS peptide PKKKRKV (Kaneda et al., supra ) and 
control peptides with the same composition but a different 

25 sequence have been synthesized on an AB1 automatic peptide 
synthesizer with an N- terminal cysteine residue. The 
cysteine peptide is then attached to the maleimide bearing 
intercalator (Figure 9) and can be anchored into double 
stranded nucleic acids. 

3 0 D. Biodegradable Linkers 

Biodegradable linkers consisting of a lys-lys peptide 
linkage are synthesized in the manner shown in Figure 10. 
In the figure r a galactose residue is placed on the 
unprotected amine. Alternatively, a protected peptide 

3 5 containing two adjacent lysine residues is synthesized by 
solid phase synthesis. The peptide carries membrane 
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permeabilization functions or targeting functions and 
acridine residues are added to the two e -amino gropus on 
the lysines . 

Example 3 

5 Gel Retardation Assay of pC LUC4 Plasmid with 

Galactosylated Intercalators and Agglutinin 
To demonstrate that the galactosylated bis-acridines 
21 and 21' of Example 2 (21' is the homolog of 21 where the 
galactose is separated from spermidine bis -acridine by 

10 three extra carbons) can interact with a soluble receptor 
while attached to DNA, we used a gel shift assay. In this 
assay, a galactose binding protein, Ricinus Communis lectin 
RCA 120 , was incubated with the galactosyl-bis-acridine-DNA 
complex. If this protein interacts with the complex and 

15 the complex remains associated with the DNA f the DNA does 
not migrate into the electrophoresis gel. Each sample of 
the plasmid pC LUC4 (2/xl; 140ng) was mixed with 13.5 pmoles 
of 21 or 21' and 1^1 (33.3 pmoles) of RCA^o was added with 
an excess of free galactose (1.35 nmoles) when indicated. 

20 After 30 minutes of incubation at room temperature, the 
samples were electrophoresed through a 0.8% agarose gel 
using a 0.04M Tris -Acetate buffer system (pH 7.6) and 
stained with ethidium bromide to visualize the DNA (Figure 
11) - 

25 Intercalation of the galactosylated spermidine bis- 

acridines into the pC LUC4 plasmid is shown by the 
retardation observed for the plasmid when complexed with 
compounds 21' (lane B) or 21 (lane E) . Intercalation of 
the bis -acridine into the DNA produces a change from the 

3 0 supercoiled form to a relaxed circular form, which migrates 
slower. 

The capability of the plasmid-galactose complex to 
bind to a soluble receptor for galactose is shown by the 
almost complete retardation of the complex in presence of 
35 Ricinus Communis lectin RCA l20 (lane C and F) . RCA 120 is a 
dimer and his two binding sites selective for terminal 
j3-D-galactosyl residues and thus too can crosslink the 
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plasmid-galactose complexes. The interaction of RCA 120 with 
the plasmid pC LUC4 when coraplexed to compounds 21 or 21* 
results in a formation of large aggregates which do not 
penetrate into the gel. This interaction appears to be 
5 much more efficient when the plasmid is complexed with 21' 
than with 21. To crosslink "the plasmids, RCA no has to 
overcome electrostatic repulsions existing between adjacent 
plasmids. Thus, separating the galactose from the surface 
of the. plasmids by a spacer arm, as in case of the 

10 complexes obtained with compound 21 ' , makes the binding of 
the lectin easier. As a result of a multivalent 
interaction, the plasmid aggregates formed by RCA 120 are 
very stable and are not dissociated by a 100 -fold excess of 
a competing monovalent ligand such as galactose (lanes D 

15 and G) . 

Example 4 

Bidding of Bis-acridines to Double-S tranded DNA 
Using Ethidium Bromide Displacement Assay 
The affinity of the bis-acridines for calf thymus DNA 

20 was calculated from the displacement of ethidium bromide 
from double stranded nucleic acids (Nielsen, supra) . 
Ethidium displacement was monitored by the decrease of the 
ethidium bromide fluorescence (ex. = 540 nm, em. = 610 nm) 
that occurs when it is released from DNA. The association 

25 constants of the bis-acridines relative to ethidium bromide 
are calculated from their IC 50 . In this study, spermidine 
bis-acridine trihydrochloride (SBA-3HC1) synthesized as 
described (Nielsen, supra ) , was used as the reference 
compound. As a result of the loss of one of its three 

30 positive charges, a slight but significant decrease in 
affinity is observed when the N 4 amino group of spermidine 
bis-acridine is engaged in an amide bond with the targeting 
carbohydrate in compound 21 (Gal-BA- 2HC1) - However, we 
predict an increase in affinity when spermidine bis- 

35 acridine is linked to the highly positively charged NLS 
peptide PKKKRKV . G. Karup et al., Xnt J Peptide Protein 
Res (1988) 32:331-343. 
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conjugates synthesized to attach targeting ligands to DNA 
in the various examples are given in Table 3. 

Table 3 

Biggociafcion Constants of f-he Bi s-acridines 
5 from Calf Thymus nWA. (in M) 

SBA-3HC1 2.4 X 10" 8 

Gal-3-bA 1 3.5 X 10' 7 

Gal-6-bA 2 7.9 X 10- 7 

Gal 3 Lys 2 -bA 3 5.4 X 10" 6 

10 Maleimido-bA 4 6.5 X 10' 7 

WTcys-bA 5 1.4 X lO' 7 

SNL-bA 6 1-4 X 10- 7 



'Compound 21 where n=3 
15 Compound 21 where n=6 

'Compound 26 (shown in Example 6) 
4 Compound 20 

5 SBA linked to CGYGPKKKRKVGG 
6 SBA linked to CGYKPKVRGKGKG 
20 The binding constants for the various bis-acridines are 
computed from an ethidium bromide displacement assay by 
using a method to determine the binding affinity of a 4-Mer 
for a linear lattice via noncooperative competitive binding 
with a 2-Mer (A. Wolfe and T. Meehan. J. Mol . Biol. 223, 
25 1063-1087, 1992) and an intrinsic dissociation constant of 
5.3 X 10"*M for ethidium bromide. 

Rxamole 5 

ahi 1 i r,v of Bi s- acrid in* Oalart.nsvl Ligands to 
Tar-get DNA to Cell-su rface Receptors 
3 0 to demonstrate the factors that control targeting 

ability of the bis-acridine intercalators containing a 
galactosyl targeting ligand, rat hepatocytes were isolated 
from rat liver and placed in culture at a density of 10 6 
hepatocytes in 60 mm petri dishes in 3 ml of MEM medium 
35 containing 5% fetal calf serum and antibiotics. The 
hepatocytes are shown to have galactose receptors by 
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hepatocytes are shown to have galactose receptors by 
binding asialorosomucoid- After 18 hours at 37°C, the 
medium is removed and replaced with 1 ml of MEM- Then 1 ug 
of 125 I- labeled plasmid DNA complexed to either BA-3HCL, 
5 Gal-3-bA, Gal-6-bA or Gal 3 -:Lys 2 -bA in 100 ul water was added 
to the culture dish. The intercalator to plasmid ratio was 
500:1 or 1000 rl. The cells were incubated for an 
additional hour at 31 x C r then rinsed and the protein 
digested in 1 ml NaOH (IN) . The cell lysate was counted 

10 for radioactivity and the protein measured. The amount of 
cell associated plasmid is expressed as ng of plasmid per 
mg of cell protein and graphed as a function of complexing 
agent (Figure 12) . Although all three galactosyl bis 
acridine compounds bind to DNA (Table 2) and can interact 

15 with a soluble galactose binding protein (Example 3) , only 
the'Gal 3 -Lys 2 -bA was able to interact with the cell surface 
receptor. Thus, efficient targeting to cell surface 
receptors requires a longer spacer arm as provided by the 
Gal 3 -Lys 2 -bA was able to interact with the cell surface 

20 receptor (synthesis shown in Figure 13 and Example 6) . 

Example 6 

Synthesis of a Biodegradable Bis-acridine 

Containing Three Targeting- Liaands : 
Trigalactosvlated Spermidine Bis -acridine 
The complete synthesis of this molecule is shown in Figure 
13. 

Synthesis of I*-Lysyl-]> Lysine bis-trif luoroacetate(22) : 
N- € -BOC-Ii-Lysine (603 mg, 2.45 mmol) and N-N-€ -bis-BOC-L- 
Lysine-p-nitrophenyl ester (2.28 g; 4.9 mmol) were mixed in 
40 ml of N-methyl morpholine containing 640 /il of N,N 
dllsopropylethylamine (3.7 mmol) - The mixture was stirred 
overnight at room temperature under argon filtered to 
remove insoluble traces of unreacted N-e-BOC-L-Lysine and 
evaporated to dryness under high vacuum. The residue was. 
purified in a silica gel column eluted with the system 
CHCl 3 /CH 3 OH/H 2 0 9:1:0.1 to afford 1.22 g of pur BOC- 
protected Lysine dimer; yield 87% 

38 
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Deprotection: To a cooled flask (dry ice) containing 
700 mg (1.2 mmol) of the BOC-protected Lysine dimer were 
added 5 ml of TFA. The mixture was warmed to room 
temperature and stirred under argon. After 3 0 minutes 
5 stirring the trif luoroacetic acid was evaporated in vacuo. 
The residue was taken up in acetone and evaporated (5 
times) • Finally the residue was redissolved in 14 ml of 
water, extracted three times with 8 ml of chloroform and 
lyophylized to give 480 mg of the tittle compound; yield 
10 80%. 

Protected trigalactosyl lysine dimer: Synthesis of 
Na- [Na, Ne-Bis (6- (l-thio-2 , 3 , 4 , 6 , -tetra-O-acetyl-0-D- 
galactopyranosyDhexanoyl] -L-Lysyl-Ne- (6 -l-thio-2 , 3 , 4 , 6 , - 
tetra-0-acetyl-/3-D-galactopyranosyl) hexanoyl] -L-Lysine 
15 (23) . 

To a solution of L-Lysyl -L-Lysine bis - trif luoroacetate 
(400 mg; 0.8 mmol) in a 8 ml of anhydrous DMF containing 
505 of triethylamine (3.6 mmol) was added p-nitrophenyl 
6 - (l-thio-2, 3,4,6, - tetra-O-acetyl-0-D- 

20 galactopyranosyl) hexanoate (1.44g; 2.4 mmol) . The mixture 
is stirred overnight under argon and evaporated to dryness. 
The residue was purified by chromatography on a silica gel 
column eluted with CHC13/CH30H/H2) 90:10:0.5 to give 463 mg 
of the title compound; yield 35%. 

25 MS: Calculated for C 13 K U2 ^0 32 S 3 m/z = 1652, found m/z = 
1653.6 (M+H)+, m/z= 1677.6 (M+Na)+, m/z = 1693.6 (M+K) + 

Reaction with selectively blocked spermidine: 
N4- [Nor- [Nor, Ne-Bis (6- (l-thio-2 ,3,4,6, - tetra-O-acetyl- 0-D- 
galactopyranosyl) hexanoyl] -L-Lysyl-Ne- (6-l-thio-2 , 3 , 4 , 6 , - 
30 tetra-0-acetyl-/3-D-galactopyranosyl) hexanoyl] -L-Lysyl] - 
Nl, N8 -bis -BOC- spermidine. (24) . 

Compound 23 (132 mg; 8 0 /xmol) was activated by 
ester if ication with N-Hydroxysuccinimide (11 mg, 9 6 /xmol) 
in the presence of DCC (20 mg; 97 jimol) in 5 ml of 
35 anhydrous methylene chloride. After 4 h stirring at room 
temperature under argon, the urea precipitate was removed 
by filtration and the filtrate was evaporated in vacuo. 
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The dry residue was redissolved in 3 ml of acetonitrile and 
added dropwise to a solution of Nl,N8-bis (t-butoxycarbonyl 
spermidine) Hydrochloride (30 mg; 8 0 /xmol) in 3 ml of 
acetonitrile containing 14 pel of triethylamine (100 /xl) ^ 
5 The mixture was further stirred for 48 h at room 
temperature under argon , evaporated in vacuo to a residue 
which was purified in a silica gel column eluted with 
CHC1 3 /CH30H/H 2 0 90:10:1 to afford 71 mg of the title 
compound; yield 45% . 

10 Deprotection: Synthesis of N4- [Na- [Na f Ne-Bis (6- (1- 

thio-/3-D-galactopyranosyl) hexanoyl] -L-Lysyl-Ne- (6- (l-thio- 
0-D-galactopyranosyl) hexanoyl] -L-Lysyl] spermidine. (25) 
Compound 24 (71 mg; 36 /xmol) was deprotected as described 
previously for compound 11. The BOC protecting groups were 

15 removed from the spermidine linker by treating with .5 ml of 
TFA for 3 0 min and the acetyl protecting groups were 
removed from the galactosyl headgroups by treating 
overnight with a mixture of CH 3 OH/NEt 3 /H 2 0 5:4^1. The bis- 
trifluoroacetate salt of the spermidine derivative was 

20 converted to the free amine by passing a water solution of 
the salt through a small BIO-RAD AG 1X2 (OH*) column. The 
fractions positive for carbohydrates and for amines were 
pooled together and lyophylized. 
36 mg of compound 25 was obtained; yield 79%. 

25 Acridine attachment: Synthesis of N4- [Na- [Na,Ne-Bis (6- (1- 
thio-/3-D-galactopyranosyl) hexanoyl] -L-Lysyl-Ne- (6- (1-thio- 
j3-D-galctopyranosyl) hexanoyl] -L-Lysyl] -N1,N8 -bis -acridine 
spermidine .(26) ( " Gal 3 -Lys 2 -bA" ) 

Compound 25 (36 mg; 28.5 jimol) and 18 mg of 9- 
30 phenoxyacridine were dissolved in 3g of phenol at 80° C and 
the solution was further stirred for 2 h at 80 °C under 
argon. The mixture was then cooled to about 40 °C and 
poured into 15 ml of ether to precipitate the 
aminoacridines . The yellow precipitate was collected by 
35 filtration on a filter paper and redissolved in 4 ml of a 
but anol /methanol mixture 3:1. This solution was then 
condensed by evaporation to about 1 ml and the bis -acridine 
derivative was isolated by chromatography on a silica gel 
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6:2:1:2. 

34 mg of the title compound are obtained; yield 21%. 

MS: Calculated fox C 8l H 117 N 9 O 20 S 3 m/z = 1631, found m/z = 

1632.8 <M+H)+, m/z = 1654.8 (M+Na) + . 

5 Example 7 

Transfection Assay Using Nuclear Localization Sequences 
5 /il of TE containing a trace amount of a 5Kb 
radioiodinated plasmid (CMV-/?Gal) and 50 /xl of water 
containing 8 nmoles of the nuclear localization peptide - 
10 bis-acridine conjugate of Example 2-C were added to 80 fiq 
of pCLUC4 (123 neq. bp) in solution in 45^1 of TE buffer 
(pH 8) . The ratio of plasmid to peptide conjugate was 
1:300. After 1 hour standing at room temperature 100 fil of 
Tris-Cl buffer (pH 9) was added to the complex and the ? 
15 resulting solution was mixed with 12 jimoles of lipids 
(DOPE/CHEMS 2:1, molar ratio) dissolved in 600 fil of ether 
for the preparation of pH-sensitive liposomes. 

The vesicles containing the DNA-peptide complexes 
were separated from nonencapsulated material by floating 
20 the liposomes through a ficoll gradient ♦ Encapsulation 
efficiency (20% ± 4%) was determinated by dynamic light 
scattering (Coulter N4, Coultronics) . 

Cells were transfected with 4 fil of liposome - 
encapsulated plasmid (100 fxl of the liposome solution) for 
25 5 hours at 37°C and luciferase activity was counted after 
4 8 hours in a bioluminometer . Table 4 shows the measured 
light units/mg cell protein as a function of the liposomal 
content. The values are the averages of three 

determinations . 
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Table 4 
Liposomal Content: 

plasmid alone plasmid-WTcvs-bA complex. plasmid- SKL-bA 

complex 

5 (0,32 ± 0. 02) 10 6 ( 0 . 82 ± 0.36) 10* (1.3S ± 0 .28) 10 6 . 



Positive control: Iiipof ectih™ = (1.4 ± 0.2) 10 8 . 

If we admit the pH-sensitive liposomes deliver their 
content into the cytoplasm of the host cell, the naked 
10 plasmid must be able to penetrate the nucleus. 

If we exclude that the peptide-bis-acridines 
conjugates protect DNA from degradation , the observed 
transfection enhancement must be a result of an increased 
nuclear entry. The 4-5 fold increase of transfection 
15 agrees with published results (Kaneda et al . , supra ) using 
proteins that bind to DNA to enhance DNA entry into the 
nucleus. Both the SNL peptide and WTcys peptide increase 
expression and are a convenient technique to target DNA 
into the nucleus. 

Example 8 
Synthesis of Cationic Bile Salts 
A. Preparation of the a-cho lic Acid Amide of ar-benvzlester 
6 - TBOC- amino - lys ine 

The synthesis is based upon that of S. Bergstrom et 
al.. Acta Chem Scand (1953) 7:1126\ 204 mg (0.500 
millimoles) of cholic acid was weighed into a screw-capped 
test-tube, and 2.5 ml dioxane and 70 microliters (0.500 
millimoles) of triethylamine was added to the tube. The 
mixture was cooled in an ice bath until the solution 
solidified (at about 12°C) . SS microliters (0.500 

millimoles) of isobutyl chlorof ormate were added, the 
reaction tube was agitated and returned to the ice bath. 
The tube was alternatively removed and replaced in the bath 
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to keep the temperature at the point of incipient 
solidification for 3 0 minutes. 

a-benyzlester N-e-TBOC lysine (0.500 millimoles) and 
70 microliters (0.500 millimoles) of triethylamine were 
5 suspended in 0 . 6 ml of water. The mixture was cooled in 
the ice bath, added to the dioxane reaction mixture, and 
the container rinsed into the reaction mixture with another 
0.5 ml of ice water. The tube stood in the ice bath for 
1/2 hour and then permitted to warm to room temperature. 
10 Most of the organic solvent was evaporated beneath a 

stream of argon gas, and the residue was brought up to 3 ml 
with water. 5% aqueous sodium carbonate was added in a 
dropwise fashion until the pH reached 9. The mixture was 
extracted with three successive 3 ml portions of ethyl 
15 ether, and the aqueous phase saved. 

To the aqueous residue, 0.5 N hydrochloric acid was 
added until the pH fell to 4. The mixture was extracted 
with three successive 3 ml portions of ethyl ether, and the 
aqueous phase saved. 

The pH of the aqueous residue was readjusted to 4 with 
0.5 N hydrochloric acid and extracted into five successive 
3 ml portions of ethylacetate . These ethylacetate extracts 
were combined and evaporated to dryness under vacuum to 
obtain 284 mg of colorless powder melting. The tBoc 
protecting group for the e -amine was removed by standard 
methods to yield the positively charged lysine derivative 
of cholic acid. In a similar fashion other positively 
charged derivatives of cholic acid can be prepared. 

B - Preparation of cho lic acid amide of tris 

3 0 ( 2 - aminoethvl ) amine 

When multiple amine groups are available for coupling 
to the activated cholic acid the amine is added in a 6 fold 
excess over the activated bile salt prepared as described 
in Example 8 -A. The synthesis is based upon that of 
35 Bergstrom et al . , supra. Weigh 204 mg (0 .500 millimoles). 

of cholic acid into a screw-capped test tube. Add 2 . 5 ml 
dioxane and 70 microliters (0.500 millimoles) of triethyl 
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amine- Cool in an ice bath until the solution commences to 
solidify (at about 12 X C). Add 65 microliters (0.500 
millimoles) of isobutyl chloro formate and after agitating, 
return the reaction tube to the ice bath.. By alternately 
5 removing from the ice bath and replacing in the bath, keep 
the temperature at the point of incipient solidification 
for 3 0 minutes. 

Add (3.00 millimoles) of tris (2-aminoethyl) amine and 
70 microliters (0.500 millimoles) of triethylamine in 0.6 
10 ml of water. Cool in the ice bath, add to the dioxane 
reaction mixture, and rinse the container into the reaction 
mixture with another 0.5 ml of ice water. Let stand in the 
ice bath for 1/2 hour and then permit to warm to room 
temperature . 

15 Evaporate most of the organic solvent beneath a stream 

of argon gas. Make the residue back up to 3 ml with water. 
Add 5% aqueous sodium carbonate dropwise until the pH 
reaches .7- Extract with three successive 3 ml portions of 
ethyl ether, saving the aqueous phase. 

20 To the aqueous residue add 0.5 N hydrochloric acid 

until the pH falls to 4. Extract with three successive 
3 ml portions of ethylether, saving the aqueous phase. 

Readjust the pH of the aqueous residue to 4 with 0.5 
N hydrochloric acid and extract into five successive 3 ml 

25 portions of ethylacetate . Combine these ethyl acetate 
extracts and evaporate to dryness under vacuum to obtain 
the cholic acid amide of tris (aminoethyl) amine. 

Example 9 

Synthesis of Polvethvlenecrlvcol-Bis Acridine 
3 0 The synthesis of PEG-coupled bis -acridine spermidine 

follows standard chemistry and involves the following 
steps : 

R'O- (CH2-CH2-0) n -H Activating agent > 

R' O (CH2-CH2-0) n -R* bis -acridine > 

35 R' O- (CH2-CH2-0) n -R-Bis acridine where R' = H or CH 3 and R* 
= activating group and n = 10-250, preferably 20-60. 

There are many methods for preparing activated 
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monomethoxy PEG molecules or activated PEG molecules . A 
preferred method has been described by D. Larwood and F. 
Szoka, J Labelled Comp & Radiopharm (1984) 21:603-614. 
Polyethylene glycol 1900 carbonyl- imidazole methyl ether 
5 was prepared by taking 530 mg (0,28 mmol) dry PEG 1900 
monomethyl ether in 2 ml dry methylene chloride and adding 
78 mg (0.46 mmol) carbonyl di imidazole and 10 mg (0.11 
mmole) imidazole (sodium salt) . After stirring overnight, 
6 ml dry methylene chloride was added and the mixture 

10 extracted with 3.75 ml water, then dried with anhydrous 
sodium sulfate. After filtration, solvent was removed, 
with quantitative yield. Alternatively, the solvent was 
removed, and the resulting oil recrystallized from 
chlorof orm/diethyl ether at -20°C. The resulting imidazole 

15 carbamate white crystals were filtered through a chilled 
funnel, rinsed with a small amount of diethyl ether, and 
used immediately. 

The imidazole carbamate (O.lmM) is added to 0.125 mM 
of N,N' -bis- (9-acridinyl) -4-aza-l , 8-diaminooctane ("bis 

20 acridine - spermidine " , prepared as described by P. Nielsen, 
Eur. J. Biochem. 122:283-289, 1992), dissolved in phenol 
and the reaction run at 8 0°C under argon for 2 hr. The 
mixture is taken to dryness and the yellow product washed 
with cold ethanol and then diethyl ether. The PEG is 

2 5 coupled via a carbamate linkage to the secondary amine of 

the bis -acridine spermidine to form the monomethoxy PEG-bis 
acridine spermidine and is soluble in water. 

In a similar fashion the non-blocked PEG (molecular 
weight 6000) , is activated as above to form the bis- 

3 0 imidazole carbamate PEG. The bis -imidazole carabamate PEG 

is reacted with a 2.5 fold excess of bis-acridine 
spermidine to form the bis (bis-acridine-spermidine) -PEG 
6000. 

Various types of activators for PEG and monomethoxy 
3 5 * PEG have been described in U.S. patent 5,013,55 6 to Woodle 
et al. These methods can be used to generate reactive PEGs 
that can be attached to the bis-acridine molecule via a 
variety of chemistries. For instance a sulfhydryl 
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containing monomethoxy-PEG can be attached to the 
maleimide- containing bis -acridine of Example 2-B. 

Example 10 
DNA-Masking vrith PEG-bis -acridine 
5 PEG molecules can be used to mask the surface of the 

DNA and permit the DNA to circulate for a longer period. 
Radio -iodinated plasmid DNA is mixed with monomethoxy-PEG- 
1900 -bis -acridine spermidine as synthesized in Example 9 at 
a 20 bp DNA-to-1 PEG molecule ratio, for 3 0 minutes at room 

10 temperature. An aliquot of the complex, 5 ug DNA in 0.2 ml 
PBS r is injected via the tail vein into each of a group of 
12 mice. Mice are sacrificed at various periods after 
injection. The blood and other organs are removed and the 
radioactivity associated with each organ is determined. 

15 DNA which has not been complexed to the monomethoxy-PEG- 
bis -acridine -spermidine is injected into a second group of 
mice, designated the control mice. Evidence showing that 
after 10 minutes , 15% of the radioactive plasmid DNA 
remains in the blood in the control mice, whereas in the 

20 monomethoxy PEG-bis acridine spermidine DNA group 
significantly greater levels of the radiolabeled plasmid^ 
PEG complex remain in circulation indicates a pronounced 
masking effect of the DNA molecule by the PEG bis acridine 
spermidine . 

25 Example 11 

Synthesis of a Lecithin Acvl Amine Masking Reagent 
The synthesis of polynucleotide masking lipids is 
accomplished by standard chemistry such as that described 
in C. Pidgeon et al . , Anal Biochem (1989) 176:36-47. 

3 0 1. 1, 12-dodecanedicarboxylic acid + DCC 

THF, 25°C > 

dodecanedicarboxylic acid anhydride (cyclic anhydride) 
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2. monoacyl lysolecithin + cyclic anhydride 

CHC1 3/ DMAP 25°C 48 hr > 

Lecithin - COOH 

3. Lecithin- COOH + carbonyldiimidazole 

CHC1 3 , 25 Q C, 2 hr > 

Lecithin imidazolide 

4 . Lecithin imidazolide + amine reactant 

CHCI3, 25°C, 24 hr > 

cationic lecithin 



10 The final reaction of the amine reactant with the 

lecithin imidazolide is undertaken immediately after 
formation of the lecithin imidazolide. The lecithin 
imidazolide (0.1 mM) is added in to a solution of the amine 
(0.7 mM) in chloroform. Suitable amines for this coupling 

15 are listed in the specification. 

After two hours at room temperature the reaction 
mixture is added to a two fold volume of water /methanol and 
the pH is adjusted to 10. The lecithin linked amine is 
extracted into the organic phase. The organic phase is 

20 then washed with 0.1 M sodium chloride and the organic 
phase taken to dryness. The resulting acyl amine lecithin 
is used to mask the surface of the polynucleotide. Various 
lysolecithin molecules can be used to prepare the lecithin- 
COOH, including dodecyl, myristoyl r palmitoyl, oleyl or 

25 phytanyl, or stearyl. Other headgroups such as 

ethanolamine or phosphatidic acid can be substituted for 
lecithin if they are suitably protected in the activation 
steps and deprotected at the end of the reaction. 

3 0 Example 12 

DNA-Masking with Lecithin Acvl Amine 
The lecithin acyl amine of Example 11 can be added to 
DNA from an ethanol solution at a ratio of 1 positive 
charge to each phosphate group on the DNA. The molecule 
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can be used to mask the surface of the DNA and permit the 
DNA to circulate for a longer period. An aliquot of the 
complex, 5 ug DNA in 0.2 ml PBS , is injected via the tail 
vein into each of a group of 12 mice. Mice are sacrificed 
5 at various periods after injection. The blood and other 
organs are removed and the radioactivity associated with 
each organ is determined. DNA which has not been complexed 
to the lecithin acyl amine is injected into a second group 
of mice, designated the control mice. Evidence showing 

10 that after 10 minutes, 15%- of the radioactive plasmid DNA 
remains in the blood in the control mice, whereas in the 
monomethoxy PEG-bis acridine spermidine DNA group 
significantly greater levels of the radiolabeled plasmid- 
PEG complex remain in circulation indicates a pronounced 

15 masking effect of the DNA molecule by the lecithin acyl 
amine . 
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CLAIMS 

We claim: 

1. A composition for presenting a polynucleotide to 
a subcellular component of a eukaryotic cell, said 
5 composition comprising the polynucleotide associated with 
a membrane -permeabilizing component capable of transporting 
the polynucleotide across the cytoplasmic membrane of said 
eukaryotic cell. 



2 . The composition of claim 1 wherein said membrane- 
10 permeabilizing component is an amphipathic cationic 
peptide. 



3 . The composition of claim 2 wherein said peptide is 
a cyclic peptide. 

4 . The composition of claim 3 wherein said cyclic 
15 peptide is selected from the group consisting of gramicidin 
S and tyrocidines. 



5 . The composition of claim 4 wherein said cyclic 
peptide is gramicidin S. 



6 . The composition of claim 1 further comprising a 
2 0 phospholipid . 



7. The composition of claim 6 wherein said 
phospholipid is a phosphatidylethanolamine . 

8 . The composition of claim 7 wherein said 
phosphatidyl ethanolamine is dioleoyl phosphatidyl - 

2 5 ethanolamine . 



9 . The composition of claim 6 wherein said 
phospholipid is in the form of liposomes. 

10. The composition of claim 1 further comprising a 
polycation. 
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11- The composition of claim 10 wherein said 
polycation is a polyamine . 

12. The- composition of claim 11 wherein said 
polyamine is selected from the group consisting of 
5 spermine, spermidine, 3,3' -diamino-bispropylamine, 
iminobis (N,N) -dimethylpropylamine, iminobis (3 -aminopropyl) - 
1 , 3 -propanediamine , and cationic dendrimers. 

13- The composition of claim 6 further comprising a 
polycation. 

10 14 . The composition of claim 1 wherein said membrane - 

permeabilizing component is a cationic bile salt having the 
formula 



HO 




wherein X and Y are independently H or OH; 
15 R 3 "is hydrogen, alkyl or alkylamine having from 

1 to 10 carbon atoms; and 

R 4 is a positively charged linear or branched 

alkyl or alkylamine having from 1 to 3 0 carbon atoms, 

wherein one or more of the carbon atoms may be substituted 
20 with NR' , wherein R' is hydrogen, alkyl or alkylamine 

having from 1 to 10 carbons . 

15 . A composition for presenting a polynucleotide to 
a subcellular component: of a eukaryotic cell comprising the 
polynucleotide associated with a cell recognition component 
25 capable of recognizing said eukaryotic cell . 
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16. The composition of claim 15 wherein said cell 
recognition component comprises a ligand for a cell -surface 
receptor on said eukaryotic cell, said ligand being coupled 
to a DNA- associating moiety. 

5 17. The composition of claim 16 wherein said DNA- 

associating moiety is an intercalating agent. 

18. The composition of claim 16 wherein said DMA- 
associating moiety is a major- or minor-groove binder. 

19. The composition of claim 17 wherein said 
10 intercalating agent has the formula 

Z 

(CH 2 ) p 
c=o 

Ar 2 -NH (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein 

Z is a bond; 

each of n and m is independently an integer of l 
to 20, and p is an integer of 0 to 20; and 

Ar, and Ar 2 are independently selected from the 
group consisting of ethidium bromide, acridine, 
mitoxantrone, oxazolopyridocarbazole, ellipticine and N- 
25 methyl -2, 7-diazapyrenium, and derivatives thereof. 

20. The composition of claim 19 wherein said 
intercalating agent is coupled to a plurality of ligands . 

21. The trigalactosylated spermidine bis-acridine 
compound (26) as shown in Figure 13. 



15 



20 



30 



22. The composition of claim 15 further comprising a 
membrane -permeabilization component . 
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23 . A composition for presenting a polynucleotide to 
a subcellular component of a eukaryotic cell comprising the 
polynucleotide associated with a subcellular-localization 
component capable of delivering the polynucleotide from the 
5 cytoplasm of said eukaryotic cell to a subcellular 
component of said eukaryotic cell . 

24. The composition of claim 23 wherein said 
subcellular component is the nucleus , and said subcellular- 
localization component is a nuclear localization component . 

10 25. The composition of claim 24 wherein said nuclear- 

localization component comprises a nuclear localization 
sequence coupled to a DNA- associating moiety. 

26. The composition of claim 25 wherein said DNA- 
associating moiety is an intercalating agent. 

15 27. The composition of claim 26 wherein said 

intercalating agent has the formula 

Z 
I 

(CH 2 ) p 

20 I 

I 

Ar 2 -NH (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein 
25 Z is a bond; 

each of n and m is independently an integer of 1 
to 20 r and p is an integer of 0 to 20; and 

Ar t and Ar 2 are independently selected from the 
group consisting of ethidium bromide, acridine, 
3 0 mitoxantrone r oxazolopyridocarbazole, ellipticine and N- 
methyl-2,?-diazapyrenium, and derivatives thereof. 



52 



WO 93/19768 



PCT/US93/03406 



28. A DNA-masking component having the formula 

o 

R 4/ N -J- < CH 2 ) n -C-0-CH-CH 2 -O-P-Y 



wherein n is an integer from 6 to 24; 

Y is selected from the group consisting of 
hydroxy, ethanolamine , choline, glycerol, serine and 
5 inositol; 

R l is alkyl or alkenyl having from 6 to 24 carbon 

atoms ; 

R 3 is hydrogen, alkyl or alkylamine having from 
1 to 10 carbon atoms; and 
10 R 4 is a positively charged linear or branched 

alkyl or alkylamine having from 1 to 3 0 carbon atoms, 
wherein one or more of the carbon atoms may be substituted 
with NR' , wherein R' is hydrogen, alkyl or alkylamine 
having from 1 to 10 carbons. 

15 29. A composition for presenting a polynucleotide to 

a subcellular component of a eukaryotic cell comprising 

(a) the polynucleotide; 

(b) a cell recognition component capable of 
recognizing said eukaryotic cell; 

2 0 (c) a membrane-permeabilizing component capable of 

transporting the polynucleotide across the cytoplasmic 
membrane of said eukaryotic cell; and 

(d) a nuclear- localization component capable of 
delivering the polynucleotide from the cytoplasm of said 

2 5 eukaryotic cell to a subcellular component of said 
eukaryotic cell. 

30. The composition of claim 29 further comprising: 

(e) a masking component capable of increasing the 
circulatory half -life of the polynucleotide. 
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31. The composition of claim 3 0 wherein said masking 
component is polyethylene glycol (PEG) covalently linked to 
a DNA-associating moiety . 

32. The composition of claim 30 wherein one or more 
5 of the cell -recognition component , the membrane- 

permeabilizing component, the subcellular-localization 
component and the masking component are capable of 
dissociation from the polynucleotide. 

33. An intercalating compound having the formula 

10 X 

I 

(CH 2 ) p 

15 I 

Ar,-NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein 

each of n and m is independently an integer of 1 to 
20, and p is an integer of 0 to 20; 

20 Ar x and Ar 2 are independently selected from the group 

consisting of ethidium bromide, acridine, mitoxantrone , 
oxazdlopyridocarbazole, ellipticine and N-methyl-2 , 7- 
diazapyrenium, and derivatives thereof; and 

X is a reactive group selected from the group 

25 consisting of N-hydroxysuccinimide , maleimide, 
maleimidopheny 1 , pyr idyl disulf ide r hydraz ide , and 
phenylglyoxal . 

34. -The intercalating compound of claim 33 wherein 
Arj and Ar 2 are acridines. 

3 0 35. The intercalating compound of claim 34 wherein X 

is maleimidophenyl . 
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36. The compound 



10 



15 



20 




37. A compound having the formula 



x-y 
I 

(CH 2 ) p 

c=o 

Ar,-NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

wherein 

each of n and m is independently an integer of l to 
20, and p is an integer of 0 to 20; 

Ar, and Ar 2 are independently selected from the group 
consisting of ethidium bromide, acridine, mitoxantrone, 
oxazolopyridocarbazole, ellipticine and N-methyl-2 , 7- 
diazapyrenium, and derivatives thereof; 

X is a reactive group selected from the group 
consisting of N-hydroxysuccinimide , maleimide, 
maleimidophenyl, pyridyl disulfide, hydrazide, and 
phenylglyoxal ; and 

Y is selected from the group consisting of cell 
surface receptor ligands, nuclear localization sequences, 
and membrane permeabilizing components. 



38. The compound of claim 37 wherein X is 
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39 . A method for making the compound having the 
formula 

X-S-Y 
I 

5 (CH 2 ) p 

C=0 
t 

Ar^NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 

10 wherein 

each of n and m is independently an integer of 1 to 
20 r and p is an integer of 0 to 20; 
X is maleimidophenyl ; 

Ar! and Ar 2 are independently selected from the group 
15 consisting of ethidium bromide, acridine, mitoxantrone, 
oxazolopyridocarbazole, ellipticine and N-methyl-2, 7- 
diazapyrenium, and derivatives thereof; and 

Y is selected from the group consisting of cell 
surface receptor ligands, subcellular localization 
20 sequences, and membrane permeabilizing components, 

said method comprising reacting a component having the 
formula 

X 
I 

25 (CH 2 ) p 

I 

C=0 
I 

Ar l -NH- (CH 2 ) n -N- (CH 2 ) m -NH-Ar 2 
3 0 with a free sulfhydryl group on compound Y. 

40 . A method for introducing polynucleotides into 
cells in vitro comprising contacting said cells with the 
composition of claim 29 . 

41. A method for introducing polynucleotides into 
35 cells in vivo comprising contacting said cells with the 

composition of claim 29 . 



56 



WO 93/19768 



PCT/US93/03406 



42. A method for introducing polynucleotides into 
plant cells comprising contacting said cells with the 
composition of claim 29 . 

43. A method for introducing polynucleotides into 
5 mammalian cells comprising contacting said cells with the 

composition of claim 29. 

44. A method for introducing polynucleotides into the 
lung of a mammal comprising administering an aerosol 
composition including the composition of claim 29 to the 

10 lung of the mammal. 

45. A method for gene therapy in a human patient 
comprising administering to said patient a composition of 
claim 29. 

46. The composition of claim 16 wherein said DNA- 
15 associating moiety is a linker strand. 

47. The composition of claim 16 wherein said DNA- 
associating moiety is a dendrimer polycation. 

48. The composition of claim 1 wherein said membrane- 
permeabilizing component is an amphipathic peptide such as 

20 GALA. 



57 



WO 93/19768 



PCT/US93/03406 




MD NLS 



Fig. 1 



1/13 



PCT/US93/03406 



MM, JMCH^ 
NM -CH-=°-MH <H 



CO ' 4 CO 

aH I. -Pro *~""\ 

co. " 5 /» 

, CH 1 (CM t ), 
CH<CH 3 ), MH, 



Fig. 2 



2/13 



WO 93/19768 



PCT/US93/03406 




Fig. 3 

3/13 



WO 93/19768 



PCI7US93/03406 



i 




' ' « ' I 1 I ' I ' I i ' | 

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 
GRAMICIDIN S / DNA charge ratio in complex 



Fig. 4 

4/13 



WO 93/19768 



PCT/US93/03406 




Fig. 5 

5/13 



WO 93/19768 



PCT/US93/03406 




Fig. 6 

6/13 



WO 93/19768 



PCT/US93/03406 




Fig. 7 

7/13 



WO 93/19768 



PCT/US93/03406 




WO 93/19768 



PCT/US93/03406 




9/13 



WO 93/19768 



PCI7US93/03406 



NH-BOC 



HOOC v 

HNBOC pH^^NBOC HNBDC 

\ /^"^2 N N-mat hyl moipho line. DIEA 

CB2N^ C ^ 0C> O- N0 2 
H O 



H 2 Pd/C 
CH 2 CI 2 , EtOH 



CBzN XH - C0 ^COOH 
H vH 



H 9 N 



CH-CQNHCOO- 



NH-BOC 



HOPH 
OH 



NH 2 



HNBOC 



AcO PAc 



NO, 



DEPROTECTION 



NH-BOC 



/ 
CONH 



CH-CONHCOO 




AcO xOAc CH-CONHCOO- 
AcO J — Wrr^^cONH 



NH 2 

PHENOXYACR1DINE 



OAc 



NH-BOC- 



HO PH 

HO^S^S^O 

OH H u 
HN H 

HN— Ar N 

O COOH 



fig. 10 





10/13 



WO 93/19768 



PCT/US93/03406 




fig.u 

11/13 



PCT/US93/03406 




■ bAJ3HCI 

0 GaI-3-bA 

m Gal-6-bA 

□ GaI3Lys2-bA 



500 



1000 



Bis-acridine/PIasmid ratio 



FIGURE 12 

12/13 



WO 93/19768 



PCT/US93/03406 



o 



HN 



^ Q HOOC N 




• 3 COO- H 3 N+. 



\ f~ 2 N-met hyl morpholin a, DlEA 
o f ( r=\ r o 



TFA 



CF 3 COO- H 3 



FIGURE 13 



Vo A N' CH " C °\ - COOH 
' H CH 



HN 
> 
O 



AcO . OAc 
OAc 



-Y 



AcO 



AcO^Ac 

AcO^as 
OAc 



OAc 




AcO .OAc 
KcOAZ^, 




o=< 



^ ✓ 

■to 



Sv^^^^CONH 
"OAc 



AcO > OAc 
OAc 



AcO^T^S 
OAc 

A 

CH-CONH JL > — MC Y <~ " 

OAc 

BOC-NH NK- NH-BOC 

>0 THF AcO ^ OAc 



NH 1)NHS.DCC 

Acetonitrile 

AcO , OAc 

AcO. 



HN 



AcO 



„CONH 



A- 



AcO 



,CONH 



CH-CONH COO- 
/ \ ' 

CONH CH 



CONH 




2A 




1) TFA 

2) CH30H/NE13/H20 

3) AG 1X2 (OH-) 



NH, 



HO .OH 
OH 



,CONI I 



HN 



HO OH 
10 



y - CH-CONH SI . 

^r^S^^^CONH CH 



OH 



CH-CONH JL . 
/ \ -^N 

CONH CH 



OH 



HO .OH 

Ho'si^S^^^CONH 
OH 

2_5 





HN 



CONH 



2 6 

Gal3Lys2-bA 



INTERNATIONAL SEARCH REPORT 



international application No. 
PCT/US93/03406 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(5) : Please See Extra Sheet. 

US CL :435/172.3; 514/44; 536/23.1; 540/1; 544/1, 7; 546/1; 800/205 
According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 435/172.3; 514/44; 536/23.1; 540/1; 544/1, 7; 546/1; 800/205 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet. 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, Volume 
112, issued 1990, G. Caminati et ah, "Photophysical Investigation 
of Starburst Dendrimers and Their Interactions with Anionic and 
Cationic Surfactants", pages 8515-8522, see entire document. 


1-20, 22-27, 29- 
32, 39-48 


Y 


PHARMACEUTICAL RESEARCH, Volume 9, number 10, issued 
1992, J-Y Legendre et al., "Delivery of Plasmid DNA into 
Mammalian Cell Lines Using pH-Sensitive Liposomes: Comparison 
with Cationic Liposomes", pages 1235-1242, see entire article. 


1-20, 22-27, 29- 
32, 39-48 


pX Further documents are listed in the continuation of Box C. [^jj See patent family annex. 



to be put of particular r 
earlier document published on or after the tacrnarional fifing c 




which may throw doubta on pnoray c t aon ^i) or 
' c publication data of enotb 
(■a specified) 



which it 



donanml pMbliabed prior to the 



later document puhtkhed alter the uternetional filing date c 
date aod not a conflict wfth the application but cited to understood the 
principle or theory underlying the invennoo 



document of particular relevance; the claimed 
considered novel or cannot be considered to involve an 
when the document 



document of particular relevance: the claimed 
considered to involve an inventive step 
combined with one or more other such d< 
benuj obvious to a person skilled in the art 

family 



Iha document 



Date of the actual completion of the international search 
14 JULY 1993 


Date of mailing of the international search report 

" C AUG 1993 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. NOT APPLICABLE 


Telephone No. (703) 308-0196 



Form PCT/IS A/210 (second sheet)(Juiy 1992)* 



INTERNATIONAL SEARCH REPORT 


International application No. I 
PCT/US93/03406 1 


C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


WO, A, 91/04753 (BAER ET AL.) 18 APRIL 1991, see entire 
patent application. 


1-20, 22-27, 29- 
32, 39-48 


Y 


WO, A, 90/09786 (SMITH ET AL.) 07 SEPTEMBER 1990, see 
entire application. 


1-48 


T 


WO, A, 93/05162 (EPAND ET AL.) 18 MARCH 1993, see 
entire application. 


1-48 


T 


PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCE, USA, Volume 90, issued February 1993, Legendre et 
aL, "Cyclic amphipathic peptide-DNA complexes mediate high- 
efficiency transfection of adherent mammalian cells", pages 893- 
897, see entire document. 


1-20, 22-27, 29- 
32, 39-48 


Y 


BIOCHIMICA ET BIOPHYSICA ACTA, Volume 817, issued 
1985, C.J. O'Conner et aL, "Bile salt damage of egg 
phosphatidylcholine liposomes", pages 95-102, see entire 
document. 


1-20, 22-27, 29- 
32, 39-48 


Y 


PLANT CELL TISSUE AND ORGAN CULTURE, Volume 22, 
issued 1990, Z. Zhu et aL, "Transformation of tobacco protoplasts 
with DNA entrapped in pH-sensitive liposomes", pages 135-145, 
see entire document. 


1-20, 22-27, 29- 
32, 39-48 


Y 


WO, A, 91/07947 (FREY ET AL.) 13 JUNE 1991, see entire 
patent application. 


1-48 


P,Y 


WO, A, 92/22635 (WU ET AL.) 23 DECEMBER 1992, see 
entire patent application. 


1-48 


Y 


WO, A, 87/02061 (PROTTER ET AL.) 09 APRIL 1987, see 
entire patent application. 


1-48 


T 


BIOCONJUGATE CHEMISTRY, Volume 4, issued 1993, J. 
Haensler et aL, "Synthesis and Characterization of a 
Trigalactosylated Bisacridine Compound To Target DNA to 
Hepatocytes", pages 85-93, see entire document. 


1-48 




US, A, 5166,320, (WU ET AL.) 24 NOVEMBER 1992, see 
entire document. 


1-48 



Form PCT/ISA/210 (continuation of second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US93/03406 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No 



PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCE, USA, Volume 87, issued June 1990, M. Cotten et aL, 
"Transferrin-polycation-mediated introduction of DNA into human 
leukemic cells: Stimulation by agents that affect the survival of 
transfected DNA or modulate transferrin receptor levels", pages 
4033-4037, see entire document. 

HUMAN GENE THERAPY, Volume 2, issued 1991, F.D. 
Ledley, "Clinical Considerations in the Design of Protocols for 
Somatic Gene Therapy", pages 77-83, see entire document. 

WO, A, 91/15501 (ROSE ET AL.) 17 OCTOBER 1991, see 
entire patent application. 



1-48 



1-48 



1-48 



Form PCT/ISA/210 (continuation of second shcct)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



.^temational application No. 
PCT/US93/03406 



A. CLASSIFICATION OF SUBJECT MATTER: 

IPC (5): A61K 37/00, 37/02. 37/20, 47/00, 47/06; C08H 61/00, 61/12; C08L 65/02; C12N 5/00. 5/06, 5/08, 5/16, 
5/22, 15/00. 15/06, 15/07, 15/11 



B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 
APS, STK-file CA 

Search terms: polynucleo?, DNA, nucleic acid?, oligonucleo?, phosphatidylethanolamine#, dileolyl, gramicidin, 
gramidicin S, tyrocidine, gala, peptide*, registry-2462-63-7, bile salt*, transfect?, deliver?, lipid?, Uposome?, mask?, 
protect?, coat?, animal, vivo, degrad?, cation? dendrimerf, gene, therap?, intercalat? 

BOX n. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

~ I. Claims 1-20, 22*27, 29-32, 40 and 4648, drawn to compositions for presenting polynucleotides to a 

cell, classified in Class 536, subclass 23.1 and the first appearing associated method of introducing 
polynucleotides into cells in vitro, classified in Class 435, subclass 172.3. 

II. Claim 21, drawn to the first appearing organic compound, classified in Class 540, various subclasses 

depending upon the constituents. 

m. Claim 28, drawn to the second appearing organic compound, classified in Class 540, various 
subclasses depending upon the constituents. 

IV. Claims 33-35, drawn to the third appearing organic compound, classified in Class 540, various 
subclasses depending upon the constituents. 

V. Claim 36, drawn to fourth appearing organic compound, classified in Class 540, various subclasses 
depending upon the constituents. 

VI. Claims 37 and 38, drawn to the fifth appearing organic compound, classified in Class 540, various 
subclasses depending upon the constituents. 

VII. Claims 39, 42 and 43, drawn to a method of preparing organic compounds classified in Class 540, 
various subclasses depending upon the constituents. 

Vm. Claims 41-44, drawn to methods of introducing DNA into cell in vivo, classified in 514, subclass 44. 
DC. Claim 45, drawn to gene therapy, classified in Class 514, subclass 44. 



Form PCT/ISA/210 (extra sheet)(July 1992)* 



- > 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US93/03406 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



1. |~j Claims Nos 



j~j Claims Ness,: 



because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out* specifically: 



3. I I Claims Nos.: 

— because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
(Telephone Practice) 
Please See Extra Sheet. 



1. 



As all required additional search fees were timely paid by the applicant, this international search report covers all searchabh 



claims 



2. Q As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite paymenl 
of any additional fee. 

3 . ("] As only some of the required additional search fees were timely paid by the applicant, this international search report coven 
^""^ only those claims for which fees were paid, specifically claims Nos.: 



4. [~ I No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
— restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest The additional search fees were accompanied by the applicant's protest. 

[™] No protest accompanied the payment of additional search fees. 



Form PCT/lSA/210 (continuation of first sheet(l))(July 1992)* 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
I^FADED TEXT OR DRAWING 
I^BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□'LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



■mis 



